Sunday, 15 Feb 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > Global Market > Nvidia claims 10x cost savings with open-source inference models
Global Market

Nvidia claims 10x cost savings with open-source inference models

Last updated: February 15, 2026 8:35 am
Published February 15, 2026
Share
Big data technology and data science illustration. Data flow concept. Querying, analysing, visualizing complex information. Neural network for artificial intelligence. Data mining. Business analytics.
SHARE

Nvidia famous that price per token went from 20 cents on the older Hopper platform to 10 cents on Blackwell. Transferring to Blackwell’s native low-precision NVFP4 format additional diminished the price to only 5 cents, so a fundamental improve gave a 4x enchancment in price per token whereas sustaining the accuracy that clients anticipate.

Nvidia outlined 4 business deployments in a blog post displaying how this mix of Blackwell infrastructure, NVFP4, optimized software program stacks and open-source fashions delivers vital price reductions. They break down like this:

  • Healthcare — In healthcare, tedious, time-consuming duties like medical coding, documentation and managing insurance coverage kinds minimize into the time docs can spend with sufferers. Sully.ai helps deal with this downside by AI brokers to deal with routine duties that take up time.

The issue is that Sully.ai’s proprietary, closed supply fashions didn’t scale effectively. So Sully.ai used Baseten’s open-source Mannequin API on Blackwell GPUs with NVFP4 knowledge format, the TensorRT-LLM library and the Dynamo inference framework .The consequence was a 90% drop in inference prices dropped by 90%, representing a 10x discount in contrast with the prior closed supply implementation, whereas response instances improved by 65% for crucial workflows like producing medical notes.

Source link

See also  NetFoundry secures $12M to disrupt legacy networking models in AI era
TAGGED: 10x, Claims, Cost, Inference, models, Nvidia, opensource, savings
Share This Article
Twitter Email Copy Link Print
Previous Article artificial intelligence AI hands conceptual Arista laments ‘horrendous’ memory situation
Next Article data-center-control-it-specialists-network-monitoring IT bonuses reward network, security skills that can’t be automated
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Instead Receives Investment from IRIS Software Group

Instead, a Miami, FL-based AI-powered tax platform supplier, acquired an funding from IRIS Software program Group.…

May 23, 2025

Broadcom Advances Private Cloud Vision at VMware Explore 2024 Barcelona

When Broadcom acquired VMware on the finish of 2023, there have been preliminary issues that…

November 7, 2024

Cloudflare Outage Resolved After Widespread Disruption

Cloudflare has applied a repair to resolve a widespread outage that affected world internet purposes,…

November 18, 2025

ARBA Retail System Launches Microservices on Azure

ARBA Retail Techniques, a frontrunner in cloud POS options for the retail and meals companies…

May 4, 2024

How Formula E uses Google Cloud AI to meet net zero targets

Formula E is utilizing Google Cloud AI to satisfy its web zero targets by driving…

January 27, 2026

You Might Also Like

Image of digital globe, with connected data points
Global Market

Starcloud prepares to launch AWS Outpost into space

By saad
data-center-control-it-specialists-network-monitoring
Global Market

IT bonuses reward network, security skills that can’t be automated

By saad
artificial intelligence AI hands conceptual
Global Market

Arista laments ‘horrendous’ memory situation

By saad
Auckland / New Zealand - November 7 2019: View of Microsoft office building
Global Market

FTC digs deeper into Microsoft’s bundling and licensing practices

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.