Thursday, 30 Apr 2026
Subscribe
logo
  • AI Compute
  • Infrastructure
  • Power & Cooling
  • Security
  • Colocation
  • Cloud Computing
  • More
    • Sustainability
    • Industry News
    • About Data Center News
    • Terms & Conditions
Font ResizerAa
Data Center NewsData Center News
Search
  • AI Compute
  • Infrastructure
  • Power & Cooling
  • Security
  • Colocation
  • Cloud Computing
  • More
    • Sustainability
    • Industry News
    • About Data Center News
    • Terms & Conditions
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI & Compute > Qwen 2.5-Max outperforms DeepSeek V3 in some benchmarks
AI & Compute

Qwen 2.5-Max outperforms DeepSeek V3 in some benchmarks

Last updated: January 29, 2025 10:47 am
Published January 29, 2025
Share
Two cyclists racing as the latest Qwen 2.5 AI model from Alibaba, Qwen 2.5-Max, outperforms competing artificial intelligence models such as DeepSeek V3 on several benchmarks.
SHARE

Alibaba’s response to DeepSeek is Qwen 2.5-Max, the corporate’s newest Combination-of-Specialists (MoE) large-scale mannequin.

Qwen 2.5-Max boasts pretraining on over 20 trillion tokens and fine-tuning by means of cutting-edge strategies like Supervised Advantageous-Tuning (SFT) and Reinforcement Studying from Human Suggestions (RLHF).

With the API now out there by means of Alibaba Cloud and the mannequin accessible for exploration through Qwen Chat, the Chinese language tech large is inviting builders and researchers to see its breakthroughs firsthand.

Outperforming friends  

When evaluating Qwen 2.5-Max’s efficiency towards a few of the most distinguished AI fashions on a wide range of benchmarks, the outcomes are promising.

Evaluations included fashionable metrics just like the MMLU-Professional for college-level problem-solving, LiveCodeBench for coding experience, LiveBench for general capabilities, and Enviornment-Arduous for assessing fashions towards human preferences.

In response to Alibaba, “Qwen 2.5-Max outperforms DeepSeek V3 in benchmarks corresponding to Enviornment-Arduous, LiveBench, LiveCodeBench, and GPQA-Diamond, whereas additionally demonstrating aggressive ends in different assessments, together with MMLU-Professional.”

AI benchmark comparison of Alibaba Qwen 2.5-Max against other artificial intelligence models such as DeepSeek V3.
(Credit score: Alibaba)

The instruct mannequin – designed for downstream duties like chat and coding – competes instantly with main fashions corresponding to GPT-4o, Claude-3.5-Sonnet, and DeepSeek V3. Amongst these, Qwen 2.5-Max managed to outperform rivals in a number of key areas.

Comparisons of base fashions additionally yielded promising outcomes. Whereas proprietary fashions like GPT-4o and Claude-3.5-Sonnet remained out of attain as a result of entry restrictions, Qwen 2.5-Max was assessed towards main public choices corresponding to DeepSeek V3, Llama-3.1-405B (the most important open-weight dense mannequin), and Qwen2.5-72B. Once more, Alibaba’s newcomer demonstrated distinctive efficiency throughout the board.

See also  What Rollup News says about battling disinformation

“Our base fashions have demonstrated important benefits throughout most benchmarks,” Alibaba said, “and we’re optimistic that developments in post-training strategies will elevate the following model of Qwen 2.5-Max to new heights.”

The burst of DeepSeek V3 has attracted consideration from the entire AI neighborhood to large-scale MoE fashions. Concurrently, now we have been constructing Qwen2.5-Max, a big MoE LLM pretrained on huge knowledge and post-trained with curated SFT and RLHF recipes. It achieves aggressive… pic.twitter.com/oHVl16vfje

— Qwen (@Alibaba_Qwen) January 28, 2025

Making Qwen 2.5-Max accessible  

To make the mannequin extra accessible to the worldwide neighborhood, Alibaba has built-in Qwen 2.5-Max with its Qwen Chat platform, the place customers can work together instantly with the mannequin in numerous capacities—whether or not exploring its search capabilities or testing its understanding of advanced queries.  

For builders, the Qwen 2.5-Max API is now out there by means of Alibaba Cloud below the mannequin identify “qwen-max-2025-01-25”. customers can get began by registering an Alibaba Cloud account, activating the Mannequin Studio service, and producing an API key.  

The API is even appropriate with OpenAI’s ecosystem, making integration simple for current initiatives and workflows. This compatibility lowers the barrier for these keen to check their purposes with the mannequin’s capabilities.

Alibaba has made a powerful assertion of intent with Qwen 2.5-Max. The corporate’s ongoing dedication to scaling AI fashions isn’t just about enhancing efficiency benchmarks but in addition about enhancing the basic considering and reasoning skills of those methods.  

“The scaling of information and mannequin dimension not solely showcases developments in mannequin intelligence but in addition displays our unwavering dedication to pioneering analysis,” Alibaba famous.  

See also  AMD unveils 5th Gen Epyc embedded processors for networking, storage and industrial edge

Trying forward, the workforce goals to push the boundaries of reinforcement studying to foster much more superior reasoning abilities. This, they are saying, might allow their fashions to not solely match however surpass human intelligence in fixing intricate issues.  

The implications for the trade might be profound. As scaling strategies enhance and Qwen fashions break new floor, we’re more likely to see additional ripples throughout AI-driven fields globally that we’ve seen in current weeks.

(Picture by Maico Amorim)

See additionally: ChatGPT Gov goals to modernise US authorities companies

Need to study extra about AI and large knowledge from trade leaders? Take a look at AI & Big Data Expo happening in Amsterdam, California, and London. The great occasion is co-located with different main occasions together with Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Discover different upcoming enterprise expertise occasions and webinars powered by TechForge here.

Tags: ai, alibaba, synthetic intelligence, fashions, qwen, qwen 2.5



Source link

TAGGED: 2.5Max, benchmarks, DeepSeek, outperforms, Qwen
Share This Article
Twitter Email Copy Link Print
Previous Article Alibaba’s Qwen2.5-Max challenges U.S. tech giants, reshapes enterprise AI Alibaba’s Qwen2.5-Max challenges U.S. tech giants, reshapes enterprise AI
Next Article Why organisations must radically evolve their disaster recovery strategies in 2025 to stay resilient Why organisations must radically evolve their disaster recovery strategies in 2025 to stay resilient
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

OpenAI, Nvidia to Announce UK Data Center Investments

(Bloomberg) -- The leaders of OpenAI and Nvidia Company plan to pledge help for billions…

September 24, 2025

Amazon’s Emissions Climbed 6% in 2024 on Data Center Buildout

(Bloomberg) -- Amazon’s carbon emissions rose for the primary time in three years in 2024,…

July 17, 2025

Can the grid cope with AI’s growing appetite?

Because the AI Power Council gathers, the query hanging within the air is: how will…

June 30, 2025

Why scaling intelligent automation requires financial rigour

Greg Holmes, Area CTO for EMEA at Apptio, an IBM firm, argues that efficiently scaling…

February 3, 2026

The TAO of data: How Databricks is optimizing  AI LLM fine-tuning without data labels

Be a part of our each day and weekly newsletters for the most recent updates…

March 28, 2025

You Might Also Like

STL launches Neuralis data centre connectivity suite in the U.S.
AI & Compute

STL launches Neuralis data centre connectivity suite in the U.S.

By saad
What is optical interconnect and why Lightelligence's $10B debut says it matters for AI
AI & Compute

What is optical interconnect and why Lightelligence’s $10B debut says it matters for AI

By saad
IBM launches AI platform Bob to regulate SDLC costs
AI & Compute

IBM launches AI platform Bob to regulate SDLC costs

By saad
The evolution of encoders: From simple models to multimodal AI
AI & Compute

The evolution of encoders: From simple models to multimodal AI

By saad

About Us

Data Center News is your dedicated source for data center infrastructure, AI compute, cloud, and industry news.

Top Categories

  • AI & Compute
  • Cloud Computing
  • Power & Cooling
  • Colocation
  • Security
  • Infrastructure
  • Sustainability
  • Industry News

Useful Links

  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

Find Us on Socials

© 2026 Data Center News. All Rights Reserved.

© 2026 Data Center News. All Rights Reserved.
Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.