Sunday, 1 Mar 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > Qwen 2.5-Max outperforms DeepSeek V3 in some benchmarks
AI

Qwen 2.5-Max outperforms DeepSeek V3 in some benchmarks

Last updated: January 29, 2025 10:47 am
Published January 29, 2025
Share
Two cyclists racing as the latest Qwen 2.5 AI model from Alibaba, Qwen 2.5-Max, outperforms competing artificial intelligence models such as DeepSeek V3 on several benchmarks.
SHARE

Alibaba’s response to DeepSeek is Qwen 2.5-Max, the corporate’s newest Combination-of-Specialists (MoE) large-scale mannequin.

Qwen 2.5-Max boasts pretraining on over 20 trillion tokens and fine-tuning by means of cutting-edge strategies like Supervised Advantageous-Tuning (SFT) and Reinforcement Studying from Human Suggestions (RLHF).

With the API now out there by means of Alibaba Cloud and the mannequin accessible for exploration through Qwen Chat, the Chinese language tech large is inviting builders and researchers to see its breakthroughs firsthand.

Outperforming friends  

When evaluating Qwen 2.5-Max’s efficiency towards a few of the most distinguished AI fashions on a wide range of benchmarks, the outcomes are promising.

Evaluations included fashionable metrics just like the MMLU-Professional for college-level problem-solving, LiveCodeBench for coding experience, LiveBench for general capabilities, and Enviornment-Arduous for assessing fashions towards human preferences.

In response to Alibaba, “Qwen 2.5-Max outperforms DeepSeek V3 in benchmarks corresponding to Enviornment-Arduous, LiveBench, LiveCodeBench, and GPQA-Diamond, whereas additionally demonstrating aggressive ends in different assessments, together with MMLU-Professional.”

AI benchmark comparison of Alibaba Qwen 2.5-Max against other artificial intelligence models such as DeepSeek V3.
(Credit score: Alibaba)

The instruct mannequin – designed for downstream duties like chat and coding – competes instantly with main fashions corresponding to GPT-4o, Claude-3.5-Sonnet, and DeepSeek V3. Amongst these, Qwen 2.5-Max managed to outperform rivals in a number of key areas.

Comparisons of base fashions additionally yielded promising outcomes. Whereas proprietary fashions like GPT-4o and Claude-3.5-Sonnet remained out of attain as a result of entry restrictions, Qwen 2.5-Max was assessed towards main public choices corresponding to DeepSeek V3, Llama-3.1-405B (the most important open-weight dense mannequin), and Qwen2.5-72B. Once more, Alibaba’s newcomer demonstrated distinctive efficiency throughout the board.

See also  DeepSeek reverts to Nvidia for R2 model after Huawei AI chip fails

“Our base fashions have demonstrated important benefits throughout most benchmarks,” Alibaba said, “and we’re optimistic that developments in post-training strategies will elevate the following model of Qwen 2.5-Max to new heights.”

The burst of DeepSeek V3 has attracted consideration from the entire AI neighborhood to large-scale MoE fashions. Concurrently, now we have been constructing Qwen2.5-Max, a big MoE LLM pretrained on huge knowledge and post-trained with curated SFT and RLHF recipes. It achieves aggressive… pic.twitter.com/oHVl16vfje

— Qwen (@Alibaba_Qwen) January 28, 2025

Making Qwen 2.5-Max accessible  

To make the mannequin extra accessible to the worldwide neighborhood, Alibaba has built-in Qwen 2.5-Max with its Qwen Chat platform, the place customers can work together instantly with the mannequin in numerous capacities—whether or not exploring its search capabilities or testing its understanding of advanced queries.  

For builders, the Qwen 2.5-Max API is now out there by means of Alibaba Cloud below the mannequin identify “qwen-max-2025-01-25”. customers can get began by registering an Alibaba Cloud account, activating the Mannequin Studio service, and producing an API key.  

The API is even appropriate with OpenAI’s ecosystem, making integration simple for current initiatives and workflows. This compatibility lowers the barrier for these keen to check their purposes with the mannequin’s capabilities.

Alibaba has made a powerful assertion of intent with Qwen 2.5-Max. The corporate’s ongoing dedication to scaling AI fashions isn’t just about enhancing efficiency benchmarks but in addition about enhancing the basic considering and reasoning skills of those methods.  

“The scaling of information and mannequin dimension not solely showcases developments in mannequin intelligence but in addition displays our unwavering dedication to pioneering analysis,” Alibaba famous.  

See also  Nvidia's strong Q2 results can't mask the ASIC challenge in their future

Trying forward, the workforce goals to push the boundaries of reinforcement studying to foster much more superior reasoning abilities. This, they are saying, might allow their fashions to not solely match however surpass human intelligence in fixing intricate issues.  

The implications for the trade might be profound. As scaling strategies enhance and Qwen fashions break new floor, we’re more likely to see additional ripples throughout AI-driven fields globally that we’ve seen in current weeks.

(Picture by Maico Amorim)

See additionally: ChatGPT Gov goals to modernise US authorities companies

Need to study extra about AI and large knowledge from trade leaders? Take a look at AI & Big Data Expo happening in Amsterdam, California, and London. The great occasion is co-located with different main occasions together with Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Discover different upcoming enterprise expertise occasions and webinars powered by TechForge here.

Tags: ai, alibaba, synthetic intelligence, fashions, qwen, qwen 2.5



Source link

TAGGED: 2.5Max, benchmarks, DeepSeek, outperforms, Qwen
Share This Article
Twitter Email Copy Link Print
Previous Article Atomicwork Atomicwork Raises $25M in Series A Funding
Next Article Clarametyx Biosciences Receives Investment From Kineticos AMR Accelerator Fund Clarametyx Biosciences Receives Investment From Kineticos AMR Accelerator Fund
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Workorb Raises Seed Funding

Workorb, a Toronto, Canada-based startup serving to corporations within the Structure, Engineering, and Development (AEC)…

November 6, 2024

AI deployments to push data centre physical infrastructure market over $50 billion

This raised outlook is because of the rising expectations of accelerated computing, which requires investments…

August 9, 2024

3 killer apps for cloud-based generative AI

I’ve been working with synthetic intelligence methods for the reason that Eighties. Again then, AI…

February 14, 2024

Mixtral 8x22B sets new benchmark for open models

Mistral AI has launched Mixtral 8x22B, which units a brand new benchmark for open supply…

April 18, 2024

Softloans Raises €1M in Pre-Seed Funding

Softloans, a Vilnius, Lithuania-based fintech startup, raised €1M in Pre-Seed funding. The spherical was led…

April 20, 2024

You Might Also Like

ASML's high-NA EUV tools clear the runway for next-gen AI chips
AI

ASML’s high-NA EUV tools clear the runway for next-gen AI chips

By saad
Poor implementation of AI may be behind workforce reduction
AI

Poor implementation of AI may be behind workforce reduction

By saad
Upgrading agentic AI for finance workflows
AI

Upgrading agentic AI for finance workflows

By saad
Goldman Sachs and Deutsche Bank test agentic AI for trade surveillance
AI

Goldman Sachs and Deutsche Bank test agentic AI in trading

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.