Galileo launched Agentic Evaluation to fix AI agent errors before it costs you


Among our daily and weekly newsletters for the latest updates and exclusive content in the main AI coverage industry. Learn more


GalileoA Startup based in San Francisco, betting that the future of artificial intelligence depends on trust. Now, the company launches a new product, Agency evaluationsIn order to answer the growing challenge to AI world: ensure that more complex systems known as AI agents actually act according to purpose.

AI agents – Autonomous systems that make multiple-step tasks such as making reports or analysis of customer data – gain traction in industries. But their fast adoption produces important question: How companies determine that these systems remain reliable after deployment? Galileo’s CEO Vikram Chatterji, believed that his company found the answer.

“Within the past six to eight months, we began to see some of our customers trying to adopt agent systems,” Chatterji said in an interview. “Today LLMs can be used as a wise router to select and select the correct API calls to complete a task. Only to make a task until completing a task is large abbreviation of being opened.

A diagram showing how Galileo evaluates AI agents in three important stages: choosing the tool, analyzing the error and completeness of the task. (Credit: Galileo)

AI agents showed promise, but businesses demanded liability

The big businesses like Cisco and EMA (The last built by the former Chief Product Officer of the Coinbase) has already adopted Galileo’s platform. These companies use AI agents to automate tasks from customer support to financial analysis, and report important products obtained.

“A sales representative trying to do outreach and outbounds may take a week at their time, compared to some of these agents with AI, they do that within two days or not yet , “Chatterji explained, promoting. the return of investment for businesses.

Galileo’s new framework evaluates the quality of the choice of tool, noticed the errors of tool calls, and tracked the overall success in the session. It also monitors important metrics for the AI’s deployment, including costs and latencies.

A dashboard showing how Galileo evaluates AI agents in three important stages: Choosing tool, analysis of error and completeness of task. (Credit: Galileo)

$ 68 million funds inspire Galileo to business ai

The launch built Galileo’s recent momentum. The company has been raised $ 45 million in series B funds led by Scale Venture Partners In October, it brought the total fund to $ 68 million. Industry analyst plans the market for AI operating tools can reach $ 4 billion in 2025.

The stakes are high while AI deployment is fast. DELIVERY DISTRIBUTED EVERYONE advanced models like GPT-4 can be hallucinate About 23% of the weather during basic question-and-answer tasks. Galileo’s tools can help businesses recognize these issues before it affects operations.

“Before we launch this item, we really need to know that this thing works,” says Chatterji, which depicts customer concerns. “The bar is very tall. So we give them this tool chain, so they can only use our metrics as basic for these tests. “

Solve AI Hallucinations and Enterprise-scale challenges

The company’s focus on reliable, prepared production solutions put it well in a market that is more concerned with AI safety. For Technical Leaders Deed Business AI, Galileo’s platform provides important guardrails to ensure AI agents act according to the purpose while controlling costs.

As the businesses extend their use of AI agents, performance monitoring tools have become an important infrastructure. Galileo’s latest celebrity refers to help with AI’s deployment businesses responsible and effective on a scale.

“2025 is the year of agents. It will be greater,” said Chatterji. “However, we see too are many companies that have been a good attempt carrying on negative implications …



Source link
  • Related Posts

    Elon Musk’s Hatred of Sam Altman Causes Conflict in Trump World

    Elon Musk, who is the richest man in the world, got himself an unelected position at the right hand of the President of the United States, complete with the West…

    Ai Chat Firm Gameon Couple is arrested for Destroying Investors over $ 60M to fund marriage and other

    Authorities arrested yesterday at the founder of Gameon Technology Alexander Charles Beckman and the lawyer Valerie Lau Beckman (a couple) due to the alleged struggle of investors of $ 60…

    Leave a Reply

    Your email address will not be published. Required fields are marked *