Friday, 10 Apr 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > Anthropic to Google: Who’s winning against AI hallucinations?
AI

Anthropic to Google: Who’s winning against AI hallucinations?

Last updated: July 29, 2024 8:07 pm
Published July 29, 2024
Share
Anthropic to Google: Who’s winning against AI hallucinations?
SHARE

Galileo, a number one developer of generative AI for enterprise functions, has launched its newest Hallucination Index.

The analysis framework – which focuses on Retrieval Augmented Technology (RAG) – assessed 22 outstanding Gen AI LLMs from main gamers together with OpenAI, Anthropic, Google, and Meta. This yr’s index expanded considerably, including 11 new fashions to mirror the fast development in each open- and closed-source LLMs over the previous eight months.

Vikram Chatterji, CEO and Co-founder of Galileo, mentioned: “In at the moment’s quickly evolving AI panorama, builders and enterprises face a vital problem: methods to harness the facility of generative AI whereas balancing value, accuracy, and reliability. Present benchmarks are sometimes based mostly on tutorial use-cases, somewhat than real-world functions.”

The index employed Galileo’s proprietary analysis metric, context adherence, to verify for output inaccuracies throughout varied enter lengths, starting from 1,000 to 100,000 tokens. This method goals to assist enterprises make knowledgeable choices about balancing worth and efficiency of their AI implementations.

Key findings from the index embrace:

  • Anthropic’s Claude 3.5 Sonnet emerged as one of the best total performing mannequin, persistently scoring near-perfect throughout quick, medium, and lengthy context eventualities.
  • Google’s Gemini 1.5 Flash ranked as one of the best performing mannequin by way of cost-effectiveness, delivering robust efficiency throughout all duties.
  • Alibaba’s Qwen2-72B-Instruct stood out as the highest open-source mannequin, significantly excelling in brief and medium context eventualities.

The index additionally highlighted a number of traits within the LLM panorama:

  • Open-source fashions are quickly closing the hole with their closed-source counterparts, providing improved hallucination efficiency at decrease prices.
  • Present RAG LLMs reveal important enhancements in dealing with prolonged context lengths with out sacrificing high quality or accuracy.
  • Smaller fashions generally outperform bigger ones, suggesting that environment friendly design will be extra essential than scale.
  • The emergence of robust performers from outdoors the US, similar to Mistral’s Mistral-large and Alibaba’s qwen2-72b-instruct, signifies a rising international competitors in LLM growth.
See also  Google DeepMind unveils protein design system

Whereas closed-source fashions like Claude 3.5 Sonnet and Gemini 1.5 Flash preserve their lead attributable to proprietary coaching information, the index reveals that the panorama is evolving quickly. Google’s efficiency was significantly noteworthy, with its open-source Gemma-7b mannequin performing poorly whereas its closed-source Gemini 1.5 Flash persistently ranked close to the highest.

Because the AI trade continues to grapple with hallucinations as a serious hurdle to production-ready Gen AI merchandise, Galileo’s Hallucination Index gives precious insights for enterprises seeking to undertake the proper mannequin for his or her particular wants and finances constraints.

See additionally: Senators probe OpenAI on security and employment practices

Wish to be taught extra about AI and large information from trade leaders? Take a look at AI & Big Data Expo happening in Amsterdam, California, and London. The great occasion is co-located with different main occasions together with Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Discover different upcoming enterprise expertise occasions and webinars powered by TechForge here.

The publish Anthropic to Google: Who’s successful towards AI hallucinations? appeared first on AI Information.

Source link

TAGGED: Anthropic, Google, hallucinations, Whos, winning
Share This Article
Twitter Email Copy Link Print
Previous Article Ceramic-based Storage Firm Secures Investment from Pure Storage Ceramic-based Storage Firm Secures Investment from Pure Storage
Next Article mergers and acquisitions ACA Group Buys Encore Compliance
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

NetEase to shut down public cloud service

NetEase is discontinuing one among its public cloud providers as competitors in China’s cloud computing…

March 11, 2025

#Memhash Now Available on Exchanges After Successful Mining Phase

Kingstown, St. Vincent and the Grenadines, February twenty eighth, 2025, Chainwire   Recognised as a…

March 2, 2025

Nuitée Raises $48M in Series A Funding

Nuitée, a Dublin, Eire-based journey tech infrastructure startup, raised $48M in Collection A funding. The spherical…

December 19, 2024

OpenAI's GPT-5.2 is here: what enterprises need to know

The rumors had been true: OpenAI on Thursday introduced the discharge of its new frontier…

December 12, 2025

Gaw Capital to Expand Data Center Portfolio in Japan | DCN

(Bloomberg) -- Hong Kong non-public fairness actual property agency Gaw Capital Companions has acquired a…

May 28, 2024

You Might Also Like

Why companies like Apple are building AI agents with limits
AI

Why companies like Apple are building AI agents with limits

By saad
Germany only - Google erweitert Gemini-Portfolio mit kosteneffizienten Modellen
Global Market

Google owns the most AI compute, and it built it its way

By saad
Agentic AI's governance challenges under the EU AI Act in 2026
AI

Agentic AI’s governance challenges under the EU AI Act in 2026

By saad
Anthropic keeps new AI model private after it finds thousands of external vulnerabilities
AI

Anthropic keeps new AI model private after it finds thousands of external vulnerabilities

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.