Friday, 20 Mar 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > Anthropic to Google: Who’s winning against AI hallucinations?
AI

Anthropic to Google: Who’s winning against AI hallucinations?

Last updated: July 29, 2024 8:07 pm
Published July 29, 2024
Share
Anthropic to Google: Who’s winning against AI hallucinations?
SHARE

Galileo, a number one developer of generative AI for enterprise functions, has launched its newest Hallucination Index.

The analysis framework – which focuses on Retrieval Augmented Technology (RAG) – assessed 22 outstanding Gen AI LLMs from main gamers together with OpenAI, Anthropic, Google, and Meta. This yr’s index expanded considerably, including 11 new fashions to mirror the fast development in each open- and closed-source LLMs over the previous eight months.

Vikram Chatterji, CEO and Co-founder of Galileo, mentioned: “In at the moment’s quickly evolving AI panorama, builders and enterprises face a vital problem: methods to harness the facility of generative AI whereas balancing value, accuracy, and reliability. Present benchmarks are sometimes based mostly on tutorial use-cases, somewhat than real-world functions.”

The index employed Galileo’s proprietary analysis metric, context adherence, to verify for output inaccuracies throughout varied enter lengths, starting from 1,000 to 100,000 tokens. This method goals to assist enterprises make knowledgeable choices about balancing worth and efficiency of their AI implementations.

Key findings from the index embrace:

  • Anthropic’s Claude 3.5 Sonnet emerged as one of the best total performing mannequin, persistently scoring near-perfect throughout quick, medium, and lengthy context eventualities.
  • Google’s Gemini 1.5 Flash ranked as one of the best performing mannequin by way of cost-effectiveness, delivering robust efficiency throughout all duties.
  • Alibaba’s Qwen2-72B-Instruct stood out as the highest open-source mannequin, significantly excelling in brief and medium context eventualities.

The index additionally highlighted a number of traits within the LLM panorama:

  • Open-source fashions are quickly closing the hole with their closed-source counterparts, providing improved hallucination efficiency at decrease prices.
  • Present RAG LLMs reveal important enhancements in dealing with prolonged context lengths with out sacrificing high quality or accuracy.
  • Smaller fashions generally outperform bigger ones, suggesting that environment friendly design will be extra essential than scale.
  • The emergence of robust performers from outdoors the US, similar to Mistral’s Mistral-large and Alibaba’s qwen2-72b-instruct, signifies a rising international competitors in LLM growth.
See also  Google to invest another $2.3 billion into Ohio data centers

Whereas closed-source fashions like Claude 3.5 Sonnet and Gemini 1.5 Flash preserve their lead attributable to proprietary coaching information, the index reveals that the panorama is evolving quickly. Google’s efficiency was significantly noteworthy, with its open-source Gemma-7b mannequin performing poorly whereas its closed-source Gemini 1.5 Flash persistently ranked close to the highest.

Because the AI trade continues to grapple with hallucinations as a serious hurdle to production-ready Gen AI merchandise, Galileo’s Hallucination Index gives precious insights for enterprises seeking to undertake the proper mannequin for his or her particular wants and finances constraints.

See additionally: Senators probe OpenAI on security and employment practices

Wish to be taught extra about AI and large information from trade leaders? Take a look at AI & Big Data Expo happening in Amsterdam, California, and London. The great occasion is co-located with different main occasions together with Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Discover different upcoming enterprise expertise occasions and webinars powered by TechForge here.

The publish Anthropic to Google: Who’s successful towards AI hallucinations? appeared first on AI Information.

Source link

TAGGED: Anthropic, Google, hallucinations, Whos, winning
Share This Article
Twitter Email Copy Link Print
Previous Article Ceramic-based Storage Firm Secures Investment from Pure Storage Ceramic-based Storage Firm Secures Investment from Pure Storage
Next Article mergers and acquisitions ACA Group Buys Encore Compliance
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

RocketLane scores $24M to build an AI layer for service delivery

Don’t miss OpenAI, Chevron, Nvidia, Kaiser Permanente, and Capital One leaders solely at VentureBeat Rework…

June 26, 2024

The rise of edge-enabled digital twins in industrial environments

By David Purón CEO of Barbara Digital twins, the digital replicas of bodily entities, are…

June 13, 2024

INXY Payments Raises $3M in Funding

INXY Payments, a Warsaw, Poland-based cost platform combining conventional finance and the crypto economic system,…

February 11, 2025

Overture Life Raises $20.6M; $57M in Total Funding

Overture Life, a Palo Alto, CA-based firm which focuses on embryology lab procedures, raised $20.6M.…

April 27, 2025

SFU’s high-performance computer to receive revolutionary upgrade

Simon Fraser College’s high-performance pc on the Cedar Nationwide Host Website has obtained a serious…

June 11, 2024

You Might Also Like

NVIDIA Agent Toolkit Gives Enterprises a Framework to Deploy AI Agents at Scale
AI

NVIDIA Agent Toolkit Gives Enterprises a Framework to Deploy AI Agents at Scale

By saad
Visa prepares payment systems for AI agent-initiated transactions
AI

Visa prepares payment systems for AI agent-initiated transactions

By saad
For effective AI, insurance needs to get its data house in order
AI

For effective AI, insurance needs to get its data house in order

By saad
Mastercard keeps tabs on fraud with new foundation model
AI

Mastercard keeps tabs on fraud with new foundation model

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.