Friday, 23 May 2025
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > Alibaba’s Qwen with Questions reasoning model beats o1-preview
AI

Alibaba’s Qwen with Questions reasoning model beats o1-preview

Last updated: November 29, 2024 5:58 pm
Published November 29, 2024
Share
Alibaba's Qwen with Questions reasoning model beats o1-preview
SHARE

Be a part of our day by day and weekly newsletters for the newest updates and unique content material on industry-leading AI protection. Be taught Extra


Chinese language e-commerce large Alibaba has launched the newest mannequin in its ever-expanding Qwen household. This one is named Qwen with Questions (QwQ), and serves as the newest open supply competitor to OpenAI’s o1 reasoning mannequin.

Like different massive reasoning fashions (LRMs), QwQ makes use of additional compute cycles throughout inference to assessment its solutions and proper its errors, making it extra appropriate for duties that require logical reasoning and planning like math and coding.

What’s Qwen with Questions (OwQ?) and might it’s used for industrial functions?

Alibaba has launched a 32-billion-parameter model of QwQ with a 32,000-token context. The mannequin is at the moment in preview, which implies a higher-performing model is prone to observe.

In response to Alibaba’s assessments, QwQ beats o1-preview on the AIME and MATH benchmarks, which consider mathematical problem-solving talents. It additionally outperforms o1-mini on GPQA, a benchmark for scientific reasoning. QwQ is inferior to o1 on the LiveCodeBench coding benchmarks however nonetheless outperforms different frontier fashions akin to GPT-4o and Claude 3.5 Sonnet.

Qwen with Questions
Instance output of Qwen with Questions

QwQ doesn’t include an accompanying paper that describes the information or the method used to coach the mannequin, which makes it troublesome to breed the mannequin’s outcomes. Nevertheless, for the reason that mannequin is open, in contrast to OpenAI o1, its “considering course of” isn’t hidden and can be utilized to make sense of how the mannequin causes when fixing issues.

See also  DeepMind framework offers breakthrough in LLMs’ reasoning

Alibaba has additionally launched the mannequin underneath an Apache 2.0 license, which implies it may be used for industrial functions.

‘We found one thing profound’

In response to a blog post that was printed together with the mannequin’s launch, “Via deep exploration and numerous trials, we found one thing profound: when given time to ponder, to query, and to replicate, the mannequin’s understanding of arithmetic and programming blossoms like a flower opening to the solar… This strategy of cautious reflection and self-questioning results in exceptional breakthroughs in fixing complicated issues.”

That is similar to what we find out about how reasoning fashions work. By producing extra tokens and reviewing their earlier responses, the fashions usually tend to right potential errors. Marco-o1, one other reasoning mannequin lately launched by Alibaba may additionally comprise hints of how QwQ is likely to be working. Marco-o1 makes use of Monte Carlo Tree Search (MCTS) and self-reflection at inference time to create completely different branches of reasoning and select the very best solutions. The mannequin was educated on a mix of chain-of-thought (CoT) examples and artificial knowledge generated with MCTS algorithms.

Alibaba factors out that QwQ nonetheless has limitations akin to mixing languages or getting caught in round reasoning loops. The mannequin is accessible for obtain on Hugging Face and a web-based demo may be discovered on Hugging Face Spaces.

The LLM age offers option to LRMs: Giant Reasoning Fashions

The discharge of o1 has triggered rising curiosity in creating LRMs, though not a lot is thought about how the mannequin works underneath the hood apart from utilizing inference-time scale to enhance the mannequin’s responses. 

See also  Breaking down Grok 3: The AI model that could redefine the industry

There at the moment are a number of Chinese language opponents to o1. Chinese language AI lab DeepSeek lately launched R1-Lite-Preview, its o1 competitor, which is at the moment solely out there by the corporate’s on-line chat interface. R1-Lite-Preview reportedly beats o1 on a number of key benchmarks.

One other lately launched mannequin is LLaVA-o1, developed by researchers from a number of universities in China, which brings the inference-time reasoning paradigm to open-source imaginative and prescient language fashions (VLMs). 

The give attention to LRMs comes at a time of uncertainty about the way forward for mannequin scaling legal guidelines. Reports point out that AI labs akin to OpenAI, Google DeepMind, and Anthropic are getting diminishing returns on coaching bigger fashions. And creating bigger volumes of high quality coaching knowledge is turning into more and more troublesome as fashions are already being educated on trillions of tokens gathered from the web. 

In the meantime, inference-time scale gives an alternate that may present the subsequent breakthrough in enhancing the talents of the subsequent technology of AI fashions. There are reviews that OpenAI is using o1 to generate synthetic reasoning data to coach the subsequent technology of its LLMs. The discharge of open reasoning fashions is prone to stimulate progress and make the area extra aggressive.


Source link
TAGGED: Alibabas, beats, Model, o1preview, Questions, Qwen, reasoning
Share This Article
Twitter Email Copy Link Print
Previous Article nLighten rebrands Euclyde to strengthen French edge data center presence nLighten rebrands Euclyde to strengthen French edge data center presence
Next Article Pepeto and Pepe Unchained Introduce zero fee trading and cross chain solutions vs layer 2 tech Pepeto and Pepe Unchained Introduce zero fee trading and cross chain solutions vs layer 2 tech
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

The risks of AI-generated code are real — here’s how enterprises can manage the risk

Be a part of our each day and weekly newsletters for the most recent updates…

March 16, 2025

Enabling high performance liquid cooling sytstems

For a few years, nVent has labored with cloud service suppliers on built-to-spec options for…

November 25, 2024

Oracle to Invest $6.5 Billion in Malaysia AI, Cloud Services Hub

(Bloomberg) -- Oracle Company plans to spend $6.5 billion constructing a cloud providers heart in…

October 3, 2024

FBI hacked thousands of computers to make malware uninstall itself

The FBI hacked about 4,200 computer systems throughout the US as a part of an…

January 14, 2025

UST Acquires Endeavor Consulting Group

UST, an Aliso Viejo, CA-based digital transformation options firm, acquired Endeavor Consulting Group, a Wayne, PA-based…

June 4, 2024

You Might Also Like

Details leak of Jony Ive's ambitious OpenAI device
AI

Details leak of Jony Ive’s ambitious OpenAI device

By saad
After GPT-4o backlash, researchers benchmark models on moral endorsement—Find sycophancy persists across the board
AI

After GPT-4o backlash, researchers benchmark models on moral endorsement—Find sycophancy persists across the board

By saad
A new era for intelligent agents and AI coding
AI

A new era for intelligent agents and AI coding

By saad
Enchant launches zero-equity accelerator for gaming and AI startups
AI

Enchant launches zero-equity accelerator for gaming and AI startups

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.OkNoPrivacy policy
You can revoke your consent any time using the Revoke consent button.Revoke consent