Saturday, 21 Mar 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > Qwen 2.5-Max outperforms DeepSeek V3 in some benchmarks
AI

Qwen 2.5-Max outperforms DeepSeek V3 in some benchmarks

Last updated: January 29, 2025 10:47 am
Published January 29, 2025
Share
Two cyclists racing as the latest Qwen 2.5 AI model from Alibaba, Qwen 2.5-Max, outperforms competing artificial intelligence models such as DeepSeek V3 on several benchmarks.
SHARE

Alibaba’s response to DeepSeek is Qwen 2.5-Max, the corporate’s newest Combination-of-Specialists (MoE) large-scale mannequin.

Qwen 2.5-Max boasts pretraining on over 20 trillion tokens and fine-tuning by means of cutting-edge strategies like Supervised Advantageous-Tuning (SFT) and Reinforcement Studying from Human Suggestions (RLHF).

With the API now out there by means of Alibaba Cloud and the mannequin accessible for exploration through Qwen Chat, the Chinese language tech large is inviting builders and researchers to see its breakthroughs firsthand.

Outperforming friends  

When evaluating Qwen 2.5-Max’s efficiency towards a few of the most distinguished AI fashions on a wide range of benchmarks, the outcomes are promising.

Evaluations included fashionable metrics just like the MMLU-Professional for college-level problem-solving, LiveCodeBench for coding experience, LiveBench for general capabilities, and Enviornment-Arduous for assessing fashions towards human preferences.

In response to Alibaba, “Qwen 2.5-Max outperforms DeepSeek V3 in benchmarks corresponding to Enviornment-Arduous, LiveBench, LiveCodeBench, and GPQA-Diamond, whereas additionally demonstrating aggressive ends in different assessments, together with MMLU-Professional.”

AI benchmark comparison of Alibaba Qwen 2.5-Max against other artificial intelligence models such as DeepSeek V3.
(Credit score: Alibaba)

The instruct mannequin – designed for downstream duties like chat and coding – competes instantly with main fashions corresponding to GPT-4o, Claude-3.5-Sonnet, and DeepSeek V3. Amongst these, Qwen 2.5-Max managed to outperform rivals in a number of key areas.

Comparisons of base fashions additionally yielded promising outcomes. Whereas proprietary fashions like GPT-4o and Claude-3.5-Sonnet remained out of attain as a result of entry restrictions, Qwen 2.5-Max was assessed towards main public choices corresponding to DeepSeek V3, Llama-3.1-405B (the most important open-weight dense mannequin), and Qwen2.5-72B. Once more, Alibaba’s newcomer demonstrated distinctive efficiency throughout the board.

See also  OpenAI drops Deep Research access to Plus users, heating up AI agent wars with DeepSeek and Claude

“Our base fashions have demonstrated important benefits throughout most benchmarks,” Alibaba said, “and we’re optimistic that developments in post-training strategies will elevate the following model of Qwen 2.5-Max to new heights.”

The burst of DeepSeek V3 has attracted consideration from the entire AI neighborhood to large-scale MoE fashions. Concurrently, now we have been constructing Qwen2.5-Max, a big MoE LLM pretrained on huge knowledge and post-trained with curated SFT and RLHF recipes. It achieves aggressive… pic.twitter.com/oHVl16vfje

— Qwen (@Alibaba_Qwen) January 28, 2025

Making Qwen 2.5-Max accessible  

To make the mannequin extra accessible to the worldwide neighborhood, Alibaba has built-in Qwen 2.5-Max with its Qwen Chat platform, the place customers can work together instantly with the mannequin in numerous capacities—whether or not exploring its search capabilities or testing its understanding of advanced queries.  

For builders, the Qwen 2.5-Max API is now out there by means of Alibaba Cloud below the mannequin identify “qwen-max-2025-01-25”. customers can get began by registering an Alibaba Cloud account, activating the Mannequin Studio service, and producing an API key.  

The API is even appropriate with OpenAI’s ecosystem, making integration simple for current initiatives and workflows. This compatibility lowers the barrier for these keen to check their purposes with the mannequin’s capabilities.

Alibaba has made a powerful assertion of intent with Qwen 2.5-Max. The corporate’s ongoing dedication to scaling AI fashions isn’t just about enhancing efficiency benchmarks but in addition about enhancing the basic considering and reasoning skills of those methods.  

“The scaling of information and mannequin dimension not solely showcases developments in mannequin intelligence but in addition displays our unwavering dedication to pioneering analysis,” Alibaba famous.  

See also  A Stytch in time: Connected Apps untangles authorization tie-ups for AI agents

Trying forward, the workforce goals to push the boundaries of reinforcement studying to foster much more superior reasoning abilities. This, they are saying, might allow their fashions to not solely match however surpass human intelligence in fixing intricate issues.  

The implications for the trade might be profound. As scaling strategies enhance and Qwen fashions break new floor, we’re more likely to see additional ripples throughout AI-driven fields globally that we’ve seen in current weeks.

(Picture by Maico Amorim)

See additionally: ChatGPT Gov goals to modernise US authorities companies

Need to study extra about AI and large knowledge from trade leaders? Take a look at AI & Big Data Expo happening in Amsterdam, California, and London. The great occasion is co-located with different main occasions together with Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Discover different upcoming enterprise expertise occasions and webinars powered by TechForge here.

Tags: ai, alibaba, synthetic intelligence, fashions, qwen, qwen 2.5



Source link

TAGGED: 2.5Max, benchmarks, DeepSeek, outperforms, Qwen
Share This Article
Twitter Email Copy Link Print
Previous Article Atomicwork Atomicwork Raises $25M in Series A Funding
Next Article Clarametyx Biosciences Receives Investment From Kineticos AMR Accelerator Fund Clarametyx Biosciences Receives Investment From Kineticos AMR Accelerator Fund
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Liquid Cooling solutions for data centres expands Park Place Technologies’ IT infrastructure offerings

Park Place Applied sciences is increasing ts portfolio of IT infrastructure providers with the introduction…

September 24, 2024

‘Smarter, faster and safer.’ Why many workplaces are embracing virtual reality

Credit score: Pixabay/CC0 Public Area Image this: You're employed in a warehouse and have to…

August 8, 2025

Prokeep Raises $25M in Series A Funding

Prokeep, a New Orleans, LA-based supplier of a buyer communication and engagement platform for distributors,…

November 13, 2024

Cato Networks acquires AI security startup Aim Security

“Whereas the world met the early indicators of this revolution with deep cynicism and tried…

September 6, 2025

Circulate Health Raises $12M in Seed Funding

Circulate Health, a Novato, CA-based longevity startup which focuses on therapeutic plasma change (TPE) to…

July 6, 2025

You Might Also Like

NVIDIA Agent Toolkit Gives Enterprises a Framework to Deploy AI Agents at Scale
AI

NVIDIA Agent Toolkit Gives Enterprises a Framework to Deploy AI Agents at Scale

By saad
Visa prepares payment systems for AI agent-initiated transactions
AI

Visa prepares payment systems for AI agent-initiated transactions

By saad
For effective AI, insurance needs to get its data house in order
AI

For effective AI, insurance needs to get its data house in order

By saad
Mastercard keeps tabs on fraud with new foundation model
AI

Mastercard keeps tabs on fraud with new foundation model

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.