Friday, 5 Dec 2025
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > Together AI’s $305M bet: Reasoning models like DeepSeek-R1 are increasing, not decreasing, GPU demand
AI

Together AI’s $305M bet: Reasoning models like DeepSeek-R1 are increasing, not decreasing, GPU demand

Last updated: February 21, 2025 1:52 pm
Published February 21, 2025
Share
Together AI's $305M bet: Reasoning models like DeepSeek-R1 are increasing, not decreasing, GPU demand
SHARE

Be a part of our every day and weekly newsletters for the newest updates and unique content material on industry-leading AI protection. Be taught Extra


When DeepSeek-R1 first emerged, the prevailing worry that shook the {industry} was that superior reasoning could possibly be achieved with much less infrastructure.

Because it seems, that’s not essentially the case. At the least, in keeping with Together AI, the rise of DeepSeek and open-source reasoning has had the precise reverse impact: As a substitute of lowering the necessity for infrastructure, it’s rising it.

That elevated demand has helped gas the expansion of Collectively AI’s platform and enterprise. Right this moment the corporate introduced a $305 million collection B spherical of funding, led by Common Catalyst and co-led by Prosperity7. Collectively AI first emerged in 2023 with an goal to simplify enterprise use of open-source massive language fashions (LLMs). The corporate expanded in 2024 with the Collectively enterprise platform, which permits AI deployment in digital non-public cloud (VPC) and on-premises environments. In 2025, Collectively AI is rising its platform as soon as once more with reasoning clusters and agentic AI capabilities. 

The corporate claims that its AI deployment platform has greater than 450,000 registered builders and that the enterprise has grown 6X general year-over-year. The corporate’s prospects embrace enterprises in addition to AI startups equivalent to  Krea AI, Captions and Pika Labs.

“We at the moment are serving fashions throughout all modalities: language and reasoning and pictures and audio and video,” Vipul Prakash, CEO of Collectively AI, instructed VentureBeat.

See also  Has Huawei outsmarted Apple in the AI race?

The massive impression DeepSeek-R1 is having on AI infrastructure demand

DeepSeek-R1 was vastly disruptive when it first debuted, for quite a lot of causes — one in every of which was the implication {that a} forefront open-source reasoning mannequin could possibly be constructed and deployed with much less infrastructure than a proprietary mannequin.

Nevertheless, Prakash defined, Collectively AI has grown its infrastructure partially to assist assist elevated demand of DeepSeek-R1 associated workloads.

“It’s a reasonably costly mannequin to run inference on,” he stated. “It has 671 billion parameters and it’s good to distribute it over a number of servers. And since the standard is increased, there’s typically extra demand on the highest finish, which suggests you want extra capability.”

Moreover, he famous that DeepSeek-R1 typically has longer-lived requests that may final two to a few minutes. Great consumer demand for DeepSeek-R1 is additional driving the necessity for extra infrastructure.

To fulfill that demand, Collectively AI has rolled out a service it calls “reasoning clusters” that provision devoted capability, starting from 128 to 2,000 chips, to run fashions at the absolute best efficiency.

How Collectively AI helps organizations use reasoning AI

There are a variety of particular areas the place Collectively AI is seeing utilization of reasoning fashions. These embrace:

  • Coding brokers: Reasoning fashions assist break down bigger issues into steps.
  • Decreasing hallucinations: The reasoning course of helps to confirm the outputs of fashions, thus lowering hallucinations, which is vital for purposes the place accuracy is essential.
  • Enhancing non-reasoning fashions: Clients are distilling and bettering the standard of non-reasoning fashions.
  • Enabling self-improvement: The usage of reinforcement studying with reasoning fashions permits fashions to recursively self-improve with out counting on massive quantities of human-labeled knowledge.
See also  AI startup Rep.ai raises $7.5M to launch 'digital twin' sales representatives

Agentic AI can be driving elevated demand for AI infrastructure 

Collectively AI can be seeing elevated infrastructure demand as its customers embrace agentic AI.

Prakash defined that agentic workflows, the place a single consumer request ends in 1000’s of API calls to finish a job, are placing extra compute demand on Collectively AI’s infrastructure.

To assist assist agentic AI workloads, Collectively AI just lately has acquired CodeSandbox, whose know-how offers light-weight, fast-booting digital machines (VMs) to execute arbitrary, safe code inside the Collectively AI cloud, the place the language fashions additionally reside. This permits Collectively AI to cut back the latency between the agentic code and the fashions that have to be referred to as, bettering the efficiency of agentic workflows.

Nvidia Blackwell is already having an impression

All AI platforms are going through elevated calls for. 

That’s one of many the reason why Nvidia retains rolling out new silicon that gives extra efficiency. Nvidia’s newest product chip is the Blackwell GPU, which is now being deployed at Collectively AI.

Prakash stated Nvidia Blackwell chips value round 25% greater than the earlier era, however present 2X the efficiency. The GB 200 platform with Blackwell chips is especially well-suited for coaching and inference of combination of knowledgeable (MoE) fashions, that are skilled throughout a number of InfiniBand-connected servers. He famous that Blackwell chips are additionally anticipated to supply an even bigger efficiency enhance for inference of bigger fashions, in comparison with smaller fashions.

The aggressive panorama of agentic AI

The market of AI infrastructure platforms is fiercely aggressive. 

See also  Amazon, Echoing Microsoft, Says It Can’t Keep Up With AI Demand

Collectively AI faces competitors from each established cloud suppliers and AI infrastructure startups. All of the hyperscalers, together with Microsoft, AWS and Google, have AI platforms. There’s additionally an rising class of AI-focussed gamers equivalent to Groq and Samba Nova which might be all aiming for a slice of the profitable market.

Collectively AI has a full-stack providing, together with GPU infrastructure with software program platform layers on high. This permits prospects to simply construct with open-source fashions or develop their very own fashions on the Collectively AI platform. The corporate additionally has a concentrate on analysis growing optimizations and accelerated runtimes for each inference and coaching.

“For example, we serve the DeepSeek-R1 mannequin at 85 tokens per second and Azure serves it at 7 tokens per second,” stated Prakash. “There’s a pretty widening hole within the efficiency and price that we are able to present to our prospects.”


Source link
TAGGED: 305M, AIs, Bet, decreasing, DeepSeekR1are, demand, GPU, increasing, models, reasoning
Share This Article
Twitter Email Copy Link Print
Previous Article Litmus advances AI at the edge with real-time DataOps and GPU acceleration Litmus advances AI at the edge with real-time DataOps and GPU acceleration
Next Article Karman+ Karman+ Raises $20M in Seed Funding
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Sine Digital Raises $2.5M in Seed Funding

Sine Digital, a London, UK-based impartial efficiency advertising company, raised $2.5m in Seed funding. The funding got…

May 17, 2024

Plume Raises $20M in Series A Funding

Plume, a NYC-based supplier of an built-in modular Layer-1 blockchain targeted on Actual World Asset…

December 19, 2024

Mission Wealth Receives Strategic Investment from Great Hill Partners

Mission Wealth, a Santa Barbara, CA-based wealth administration agency, acquired an funding from Nice Hill Companions.…

May 4, 2025

Essential Measures for Business Continuity

From texting and streaming companies to vital authorities, schooling, and healthcare purposes, knowledge facilities allow…

July 29, 2024

The evolution of data center semiconductors: Navigating the AI revolution

The spine of worldwide cloud and AI infrastructure is present process a profound transformation, led…

August 18, 2025

You Might Also Like

Frontier AI agents replace chatbots
AI

Frontier AI agents replace chatbots

By saad
AWS launches Kiro powers with Stripe, Figma, and Datadog integrations for AI-assisted coding
AI

AWS launches Kiro powers with Stripe, Figma, and Datadog integrations for AI-assisted coding

By saad
AI Memory Hunger Forces Micron Consumer Exit
AI

AI Memory Hunger Forces Micron Consumer Exit

By saad
Gemini 3 Pro scores 69% trust in blinded testing up from 16% for Gemini 2.5: The case for evaluating AI on real-world trust, not academic benchmarks
AI

Gemini 3 Pro scores 69% trust in blinded testing up from 16% for Gemini 2.5: The case for evaluating AI on real-world trust, not academic benchmarks

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.