Saturday, 28 Feb 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > ByteDance releases new open source Seed-OSS-36B model
AI

ByteDance releases new open source Seed-OSS-36B model

Last updated: August 21, 2025 4:20 pm
Published August 21, 2025
Share
ByteDance releases new open source Seed-OSS-36B model
SHARE

Need smarter insights in your inbox? Join our weekly newsletters to get solely what issues to enterprise AI, information, and safety leaders. Subscribe Now


TikTok is making headlines once more right now after the White House joined the popular social media application — however its mum or dad firm ByteDance, a Chinese language internet large, additionally had a shock announcement up its sleeve.

The corporate’s Seed Group of AI researchers today released Seed-OSS-36B on AI code sharing web site Hugging Face.

Seed-OSS-36B is new line of open supply, massive language fashions (LLM) designed for superior reasoning, and developer-focused usability with a longer token context — that’s, how a lot data the fashions can settle for as inputs after which output in a single alternate — than many competing LLMs from U.S. tech firms, even leaders resembling OpenAI and Anthropic.

The gathering introduces three most important variants:


AI Scaling Hits Its Limits

Energy caps, rising token prices, and inference delays are reshaping enterprise AI. Be a part of our unique salon to find how prime groups are:

  • Turning power right into a strategic benefit
  • Architecting environment friendly inference for actual throughput good points
  • Unlocking aggressive ROI with sustainable AI techniques

Safe your spot to remain forward: https://bit.ly/4mwGngO


  • Seed-OSS-36B-Base with artificial information
  • Seed-OSS-36B-Base with out artificial information
  • Seed-OSS-36B-Instruct

In releasing each artificial and non-synthetic variations of the Seed-OSS-36B-Base mannequin, the Seed Group sought to steadiness sensible efficiency with analysis flexibility.

The synthetic-data variant, educated with further instruction information, persistently delivers stronger scores on commonplace benchmarks and is meant as a higher-performing general-purpose choice.

See also  Salesforce drops Agentforce 2.0, brings reasoning AI to enterprise

The non-synthetic mannequin, in contrast, omits these augmentations, creating a cleaner basis that avoids potential bias or distortion launched by artificial instruction information.

By offering each, the crew provides utilized customers entry to improved outcomes whereas guaranteeing researchers retain a impartial baseline for finding out post-training strategies.

In the meantime, the Seed-OSS-36B-Instruct mannequin differs in that it’s post-trained with instruction information to prioritize job execution and instruction following, relatively than serving purely as a basis mannequin.

All three fashions are launched beneath the Apache-2.0 license, permitting free use, modification, and redistribution by researchers and builders working for enterprises.

Which means they can be utilized to energy industrial functions, inside to an organization or exterior/customer-facing, with out paying ByteDance any licensing charges or for utility programming interface (API) utilization.

This continues the summer 2025 trend of Chinese companies shipping powerful open source models with OpenAI trying to meet up with its personal open supply gpt-oss duet launched earlier this month.

The Seed Group positions Seed-OSS for worldwide functions, emphasizing versatility throughout reasoning, agent-like job execution, and multilingual settings.

The Seed Group, shaped in 2023, has targeting constructing basis fashions that may serve each analysis and utilized use instances.

Design and core options

The structure behind Seed-OSS-36B combines acquainted design selections resembling causal language modeling, grouped question consideration, SwiGLU activation, RMSNorm, and RoPE positional encoding.

Every mannequin carries 36 billion parameters throughout 64 layers and helps a vocabulary of 155,000 tokens.

One of many defining options is its native long-context functionality, with a most size of 512,000 tokens, designed to course of prolonged paperwork and reasoning chains with out efficiency loss.

See also  From reality to fantasy: Live2Diff AI brings instant video stylization to life

That’s twice the size of OpenAI’s new GPT-5 mannequin household and is roughly equal to about 1,600 pages of textual content, the size of a Christian Bible.

One other distinguishing factor is the introduction of a considering price range, which lets builders specify how a lot reasoning the mannequin ought to carry out earlier than delivering a solution.

It’s one thing we’ve seen from different latest open supply fashions as nicely, together with Nvidia’s new Nemotron-Nano-9B-v2, additionally available on Hugging Face.

In apply, this implies groups can tune efficiency relying on the complexity of the duty and the effectivity necessities of deployment.

Budgets are really useful in multiples of 512 tokens, with 0 offering a direct response mode/

Aggressive efficiency on third-party benchmarks

Benchmarks revealed with the discharge place Seed-OSS-36B among the many stronger massive open-source fashions. The Instruct variant, particularly, posts state-of-the-art ends in a number of areas.

  • Math and reasoning: Seed-OSS-36B-Instruct achieves 91.7 % on AIME24 and 65 on BeyondAIME, each representing open-source “state-of-the-art” (SOTA).
  • Coding: On LiveCodeBench v6, the Instruct mannequin data 67.4, one other SOTA rating.
  • Lengthy-context dealing with: On RULER at 128K context size, it reaches 94.6, marking the very best open-source end result reported.
  • Base mannequin efficiency: The synthetic-data Base variant delivers 65.1 on MMLU-Professional and 81.7 on MATH, each state-of-the-art ends in their classes.

The no-synthetic Base model, whereas barely behind on many measures, proves aggressive in its personal proper.

It outperforms its artificial counterpart on GPQA-D, offering researchers with a cleaner, instruction-free baseline for experimentation.

For enterprises evaluating open choices, these outcomes counsel Seed-OSS presents robust potential throughout math-heavy, coding, and long-context workloads whereas nonetheless offering flexibility for analysis use instances.

See also  Rapt AI and AMD work to make GPU utilization more efficient

Entry and deployment

Past efficiency, the Seed Group highlights accessibility for builders and practitioners. The fashions will be deployed utilizing Hugging Face Transformers, with quantization help in each 4-bit and 8-bit codecs to cut back reminiscence necessities.

Additionally they combine with vLLM for scalable serving, together with configuration examples and API server directions.

To decrease obstacles additional, the crew consists of scripts for inference, immediate customization, and gear integration.

For technical leaders managing small groups or working beneath price range constraints, these provisions are positioned to make experimentation with 36-billion-parameter fashions extra approachable.

Licensing and concerns for enterprise decision-makers

With the fashions supplied beneath Apache-2.0, organizations can undertake them with out restrictive licensing phrases, an necessary issue for groups balancing authorized and operational considerations.

For choice makers evaluating the open-source panorama, the discharge brings three takeaways:

  • State-of-the-art benchmarks throughout math, coding, and long-context reasoning.
  • A steadiness between higher-performing synthetic-trained fashions and clear analysis baselines.
  • Accessibility options that decrease operational overhead for lean engineering groups.

By putting robust efficiency and versatile deployment beneath an open license, ByteDance’s Seed Group has added new choices for enterprises, researchers, and builders alike.


Source link
TAGGED: ByteDance, Model, Open, releases, SeedOSS36B, source
Share This Article
Twitter Email Copy Link Print
Previous Article Alif microcontrollers enable genAI on ultra-low-power edge devices Alif microcontrollers enable genAI on ultra-low-power edge devices
Next Article tech workers in data center outsourcing Fluke Networks expands testing to help ease data center networking challenges
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Vapor IO, Supermicro team up to unveil Zero Gap AI powered by NVIDIA

Vapor IO and Supermicro unveil Zero Gap AI, a partnership with the NVIDIA MGX platform…

March 6, 2024

BigQuery is 5x bigger than Snowflake and Databricks: What Google is doing to make it even better

Be part of our every day and weekly newsletters for the most recent updates and…

April 20, 2025

ACL Digital, PhoenixAI.tech partner to improve drone technology leveraging edge computing

ACL Digital has entered right into a strategic partnership with PhoenixAI.tech to advance drone expertise…

February 28, 2024

Getting MultiCloud Ready with Cisco SD-WAN Cloud On Ramp

As organizations more and more undertake cloud computing, it may be crucial that their WAN…

June 25, 2024

Cost optimisation, FinOps and GenAI key takeaways

Flexera has admitted that 2024 is a ‘complicated’ yr for cloud adoption as the corporate…

March 12, 2024

You Might Also Like

ASML's high-NA EUV tools clear the runway for next-gen AI chips
AI

ASML’s high-NA EUV tools clear the runway for next-gen AI chips

By saad
Poor implementation of AI may be behind workforce reduction
AI

Poor implementation of AI may be behind workforce reduction

By saad
Upgrading agentic AI for finance workflows
AI

Upgrading agentic AI for finance workflows

By saad
Goldman Sachs and Deutsche Bank test agentic AI for trade surveillance
AI

Goldman Sachs and Deutsche Bank test agentic AI in trading

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.