Saturday, 28 Feb 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > OpenAI’s o1 model doesn’t show its thinking, giving open source an advantage
AI

OpenAI’s o1 model doesn’t show its thinking, giving open source an advantage

Last updated: December 11, 2024 2:14 am
Published December 11, 2024
Share
OpenAI's o1 model doesn't show its thinking, giving open source an advantage
SHARE

Be part of our every day and weekly newsletters for the most recent updates and unique content material on industry-leading AI protection. Be taught Extra


OpenAI has ushered in a brand new reasoning paradigm in massive language fashions (LLMs) with its o1 mannequin, which lately obtained a serious improve. Nevertheless, whereas OpenAI has a robust lead in reasoning fashions, it would lose some floor to open supply rivals which might be rapidly rising.

Fashions like o1, typically known as massive reasoning fashions (LRMs), use further inference-time compute cycles to “assume” extra, overview their responses and proper their solutions. This permits them to unravel complicated reasoning issues that traditional LLMs wrestle with and makes them particularly helpful for duties corresponding to coding, math and knowledge evaluation. 

Nevertheless, in current days, builders have proven blended reactions to o1, particularly after the up to date launch. Some have posted examples of o1 engaging in unbelievable duties whereas others have expressed frustration over the mannequin’s complicated responses. Builders have skilled all types of issues from making illogical modifications to code or ignoring directions.

Secrecy round o1 particulars

A part of the confusion is because of OpenAI’s secrecy and refusal to point out the small print of how o1 works. The key sauce behind the success of LRMs is the additional tokens that the mannequin generates because it reaches the ultimate response, known as the mannequin’s “ideas” or “reasoning chain.” For instance, in the event you immediate a traditional LLM to generate code for a process, it can instantly generate the code. In distinction, an LRM will generate reasoning tokens that look at the issue, plan the construction of code, and generate a number of options earlier than emitting the ultimate reply.

See also  Kumo's 'relational foundation model' predicts the future your LLM can't see

o1 hides the pondering course of and solely exhibits the ultimate response together with a message that shows how lengthy the mannequin thought and probably a excessive overview of the reasoning course of. That is partly to keep away from cluttering the response and offering a smoother consumer expertise. However extra importantly, OpenAI considers the reasoning chain as a commerce secret and needs to make it troublesome for rivals to copy o1’s capabilities.

The prices of coaching new fashions proceed to develop and revenue margins usually are not holding tempo, which is pushing some AI labs to turn into extra secretive in an effort to lengthen their lead. Even Apollo analysis, which did the red-teaming of the model, was not given entry to its reasoning chain.

This lack of transparency has led customers to make all types of speculations, together with accusing OpenAI of degrading the mannequin to chop inference prices.

Open-source fashions totally clear

Alternatively, open supply alternate options corresponding to Alibaba’s Qwen with Questions and Marco-o1 present the complete reasoning chain of their fashions. One other various is DeepSeek R1, which isn’t open supply however nonetheless reveals the reasoning tokens. Seeing the reasoning chain allows builders to troubleshoot their prompts and discover methods to enhance the mannequin’s responses by including extra directions or in-context examples.

Visibility into the reasoning course of is particularly vital if you wish to combine the mannequin’s responses into purposes and instruments that anticipate constant outcomes. Furthermore, having management over the underlying mannequin is vital in enterprise purposes. Personal fashions and the scaffolding that helps them, such because the safeguards and filters that take a look at their inputs and outputs, are continually altering. Whereas this will lead to higher total efficiency, it will probably break many prompts and purposes that had been constructed on prime of them. In distinction, open supply fashions give full management of the mannequin to the developer, which is usually a extra sturdy choice for enterprise purposes, the place efficiency on very particular duties is extra vital than common expertise.

See also  DeepSeek reverts to Nvidia for R2 model after Huawei AI chip fails

QwQ and R1 are nonetheless in preview variations and o1 has the lead when it comes to accuracy and ease of use. And for a lot of makes use of, corresponding to making common advert hoc prompts and one-time requests, o1 can nonetheless be a greater choice than the open supply alternate options. 

However the open-source neighborhood is fast to meet up with personal fashions and we are able to anticipate extra fashions to emerge within the coming months. They will flip into an acceptable various the place visibility and management are essential.


Source link
TAGGED: Advantage, doesnt, giving, Model, Open, OpenAIs, show, source, thinking
Share This Article
Twitter Email Copy Link Print
Previous Article Hyro Hyro Receives Series B Extension Funding Led by Healthier Capital
Next Article shutterstock 440449237 gush of water from a fountain Microsoft to launch zero water consumption cooling for future data centers
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Zest Health Raises $13M in Funding

Zest Health, a NYC-based digital care firm centered on treating sufferers with inflammatory pores and…

February 4, 2025

Why SSE Matters More Than Mesh for Data Centers

In 2021, Gartner declared Cybersecurity Mesh Structure (CSMA) as a defining pattern in cybersecurity, heralding…

November 21, 2025

Amazon’s sovereign cloud tests Europe’s data control rules

For a lot of European organisations, cloud choices are now not nearly price, scale, or…

January 16, 2026

The specific skill cloud engineers need to be successful

Matthew Smith, Head of DevOps & Cloud at Ten10, discusses the scarcity of cloud engineering…

April 16, 2024

Forrester on cybersecurity budgeting: 2025, the year of CISO fiscal accountability

Be part of our day by day and weekly newsletters for the most recent updates…

December 30, 2024

You Might Also Like

ASML's high-NA EUV tools clear the runway for next-gen AI chips
AI

ASML’s high-NA EUV tools clear the runway for next-gen AI chips

By saad
Poor implementation of AI may be behind workforce reduction
AI

Poor implementation of AI may be behind workforce reduction

By saad
Upgrading agentic AI for finance workflows
AI

Upgrading agentic AI for finance workflows

By saad
Goldman Sachs and Deutsche Bank test agentic AI for trade surveillance
AI

Goldman Sachs and Deutsche Bank test agentic AI in trading

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.