Friday, 10 Apr 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > Edge Computing > Rafay unveils serverless inference to power AI-as-a-Service for GPU cloud providers
Edge Computing

Rafay unveils serverless inference to power AI-as-a-Service for GPU cloud providers

Last updated: May 21, 2025 2:30 pm
Published May 21, 2025
Share
Rafay unveils serverless inference to power AI-as-a-Service for GPU cloud providers
SHARE

Rafay launched a Serverless Inference providing to assist NVIDIA Cloud Companions (NCPs) and GPU Cloud Suppliers ship high-margin AI providers shortly and cost-effectively. 

The providing supplies a token-metered API for operating open-source and privately skilled/tuned giant language fashions (LLMs). Key options embody seamless developer integration, clever infrastructure administration, built-in metering and billing, enterprise-grade safety, and observability instruments. 

It permits NCPs and GPU Clouds to transition from GPU-as-a-Service to AI-as-a-Service, addressing the rising demand within the AI inference market. The answer eliminates infrastructure complexity, permitting builders and enterprises to combine generative AI workflows into functions quickly. 

“Having spent the final 12 months experimenting with GenAI, many enterprises at the moment are centered on constructing agentic AI functions that increase and improve their enterprise choices,” says Haseeb Budhani, CEO and co-founder of Rafay Programs. “The flexibility to quickly eat GenAI fashions by way of inference endpoints is vital to sooner growth of GenAI capabilities. That is the place Rafay’s NCP and GPU Cloud companions have a fabric benefit.” 

This answer represents a shift in the direction of extra dynamic, scalable AI workloads that may function nearer to information sources, lowering latency and enhancing real-time processing. Moreover, it may speed up the adoption of edge-based machine studying functions throughout industries, driving development in edge AI inference markets.

The global AI inference market is projected to develop considerably, reaching $106 billion by 2025 and $254 billion by 2030. 

Rafay’s platform helps multi-tenant GPU/CPU infrastructure and can quickly embody fine-tuning capabilities for AI fashions. Rafay goals to simplify cloud-native and AI infrastructure administration, with prospects akin to MoneyGram and Guardant Well being leveraging its options.

See also  Rockwell pushes data to the edge with launch of OptixEdge gateway

Associated

AI/ML  |  cloud infrastructure  |  generative AI  |  GPU cloud  |  serverless inference

Source link

TAGGED: AIasaService, cloud, GPU, Inference, Power, Providers, Rafay, serverless, unveils
Share This Article
Twitter Email Copy Link Print
Previous Article Zoca Founders Zoca Raises $6M in Funding
Next Article mains inline connector Which mains inline connector should you use? A practical guide
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

From prompt chaos to clarity: How to build a robust AI orchestration layer

Be part of the occasion trusted by enterprise leaders for practically 20 years. VB Remodel…

June 18, 2025

Veracode Acquires Phylum

Veracode, a Burlington, MA-based firm which makes a speciality of utility threat administration, acquired the…

January 6, 2025

Why Infrastructure Matters in the Race for Adoption

AI is prepared for companies, however are companies prepared for AI? That’s one of many…

January 22, 2025

Microsoft is building new Windows security features to prevent another CrowdStrike incident

Microsoft is asserting plans to make adjustments to Home windows that may assist CrowdStrike and…

September 13, 2024

Security first for Schneider Electric’s EcoStruxure IT DCIM solutions

Schneider Electrical has introduced what it believes to be an business first: its EcoStruxure IT…

October 7, 2024

You Might Also Like

Premio targets multi-camera edge AI with new Jetson Orin systems
Edge Computing

Premio targets multi-camera edge AI with new Jetson Orin systems

By saad
Hosted.ai raises $19M to tackle GPU underutilization and reshape AI infrastructure economics
Edge Computing

Hosted.ai raises $19M to tackle GPU underutilization and reshape AI infrastructure economics

By saad
NVIDIA and T-Mobile push AI-RAN to turn 5G networks into distributed edge compute platforms
Edge Computing

NVIDIA and T-Mobile push AI-RAN to turn 5G networks into distributed edge compute platforms

By saad
Nscale moves into power with AIPCorp deal, building 8GW U.S. AI campus to bypass energy bottlenecks
Edge Computing

Nscale moves into power with AIPCorp deal, building 8GW U.S. AI campus to bypass energy bottlenecks

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.