Sunday, 14 Dec 2025
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > Nvidia and DataStax just made generative AI smarter and leaner — here’s how
AI

Nvidia and DataStax just made generative AI smarter and leaner — here’s how

Last updated: December 26, 2024 5:08 pm
Published December 26, 2024
Share
Nvidia and DataStax just made generative AI smarter and leaner — here’s how
SHARE

Be part of our each day and weekly newsletters for the newest updates and unique content material on industry-leading AI protection. Study Extra


Nvidia and DataStax launched new know-how immediately that dramatically reduces storage necessities for corporations deploying generative AI programs, whereas enabling quicker and extra correct data retrieval throughout a number of languages.

The brand new Nvidia NeMo Retriever microservices, built-in with DataStax’s AI platform, cuts knowledge storage quantity by 35 instances in comparison with conventional approaches — a vital functionality, as enterprise knowledge is projected to achieve more than 20 zettabytes by 2027.

“At this time’s enterprise unstructured knowledge is at 11 zettabytes, roughly equal to 800,000 copies of the Library of Congress, and 83% of that’s unstructured with 50% being audio and video,” stated Kari Briski, VP of product administration for AI at Nvidia, in an interview with VentureBeat. “Considerably lowering these storage prices whereas enabling corporations to successfully embed and retrieve data turns into a recreation changer.”

Nvidia’s NeMo Retriever know-how delivers a 35x enchancment in knowledge storage effectivity, as illustrated in a comparability of uncooked textual content storage, baseline vector embeddings, and lowered embedding dimensions. This breakthrough underpins the scalability of generative AI throughout enterprise functions. (Credit score: Nvidia)

The know-how is already proving transformative for Wikimedia Foundation, which used the built-in answer to scale back processing time for 10 million Wikipedia entries from 30 days to underneath three days. The system handles real-time updates throughout a whole bunch of hundreds of entries being edited each day by 24,000 international volunteers.

“You’ll be able to’t simply depend on massive language fashions for content material — you want context out of your present enterprise knowledge,” defined Chet Kapoor, CEO of DataStax. “That is the place our hybrid search functionality is available in, combining each semantic search and conventional textual content search, then utilizing Nvidia’s re-ranker know-how to ship essentially the most related leads to actual time at international scale.”

See also  How AI helped refine Hungarian accents in The Brutalist

Enterprise knowledge safety meets AI accessibility

The partnership addresses a vital problem going through enterprises: the way to make their huge shops of personal knowledge accessible to AI programs with out exposing delicate data to exterior language fashions.

“Take FedEx — 60% of their knowledge sits in our merchandise, together with all package deal supply data for the previous 20 years with private particulars. That’s not going to Gemini or OpenAI anytime quickly, or ever,” Kapoor defined.

The know-how is discovering early adoption throughout industries, with monetary companies companies main the cost regardless of regulatory constraints. “I’ve been blown away by how far forward monetary companies companies are actually,” stated Kapoor, citing Commonwealth Bank of Australia and Capital One as examples.

The subsequent frontier for AI: Multimodal doc processing

Wanting forward, Nvidia plans to broaden the know-how’s capabilities to deal with extra advanced doc codecs. “We’re seeing nice outcomes with multimodal PDF processing — understanding tables, graphs, charts and pictures and the way they relate throughout pages,” Briski revealed. “It’s a very onerous drawback that we’re excited to sort out.”

For enterprises drowning in unstructured knowledge whereas attempting to deploy AI responsibly, the brand new providing supplies a path to make their data belongings AI-ready with out compromising safety or breaking the financial institution on storage prices. The answer is on the market instantly via the Nvidia API catalog with a 90-day free trial license.

The announcement underscores the rising give attention to enterprise AI infrastructure as corporations transfer past experimentation to large-scale deployment, with knowledge administration and value effectivity turning into vital success elements.

See also  Supermicro Rolls Out NVIDIA Blackwell Ultra Systems for AI Factories

Source link
TAGGED: DataStax, generative, Heres, leaner, Nvidia, smarter
Share This Article
Twitter Email Copy Link Print
Previous Article Top 10 Data Center Security Stories of 2024 Top 10 Data Center Security Stories of 2024
Next Article In the Shadows of Arizona’s Data Center Boom, Thousands Live Without Power In the Shadows of Arizona’s Data Center Boom, Thousands Live Without Power
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

AI sprint risks environmental catastrophe

The federal government is urged to mandate stricter reporting for information centres to mitigate environmental…

February 9, 2025

Schneider Electric Invests $140M in US Facilities to Meet Data Center Demand

Schneider Electrical, international provider of options for power administration and automation, plans to position an…

March 22, 2024

California Lawmaker Unveils Landmark AI bill | DCN

(The Washington Post) -- A California state lawmaker introduced a bill on Thursday aiming to force…

February 12, 2024

European Data Center Hub Sees Disappointing Clean-Power AuctionEuropean Data Center Hub Sees Disappointing Clean-Power Auction

(Bloomberg) -- Eire’s newest public sale procured 2,071 gigawatt-hours of clean-power initiatives, falling wanting the…

September 10, 2024

New Crypto Casino Platform Winna.com Secures $15 Million in Seed Funding

San Jose, Costa Rica, December twenty third, 2024, Chainwire   Winna.com, a crypto-focused on line…

December 23, 2024

You Might Also Like

Nous Research just released Nomos 1, an open-source AI that ranks second on the notoriously brutal Putnam math exam
AI

Nous Research just released Nomos 1, an open-source AI that ranks second on the notoriously brutal Putnam math exam

By saad
Enterprise users swap AI pilots for deep integrations
AI

Enterprise users swap AI pilots for deep integrations

By saad
Why most enterprise AI coding pilots underperform (Hint: It's not the model)
AI

Why most enterprise AI coding pilots underperform (Hint: It's not the model)

By saad
Newsweek: Building AI-resilience for the next era of information
AI

Newsweek: Building AI-resilience for the next era of information

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.