Friday, 1 May 2026
Subscribe
logo
  • AI Compute
  • Infrastructure
  • Power & Cooling
  • Security
  • Colocation
  • Cloud Computing
  • More
    • Sustainability
    • Industry News
    • About Data Center News
    • Terms & Conditions
Font ResizerAa
Data Center NewsData Center News
Search
  • AI Compute
  • Infrastructure
  • Power & Cooling
  • Security
  • Colocation
  • Cloud Computing
  • More
    • Sustainability
    • Industry News
    • About Data Center News
    • Terms & Conditions
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > Power & Cooling > F5 and NVIDIA expand collaboration on AI infrastructure
Power & Cooling

F5 and NVIDIA expand collaboration on AI infrastructure

Last updated: March 25, 2026 2:23 pm
Published March 25, 2026
Share
F5 and NVIDIA expand collaboration on AI infrastructure
SHARE

F5, a supplier of software and API supply and safety options, has introduced expanded capabilities in collaboration with NVIDIA to boost AI inference infrastructures. This collaboration integrates F5 BIG-IP Subsequent for Kubernetes with NVIDIA BlueField-3 DPUs, making a telemetry-aware infrastructure layer. The combination is designed to extend token throughput by means of improved GPU utilisation, cut back latency, and assist safe multi-tenant AI platforms at scale.

In AI methods, tokens are measurable items of AI output, similar to phrases or knowledge fragments generated throughout inference. The manufacturing fee of those tokens impacts person expertise, infrastructure effectivity, and income per accelerator. As companies and GPU-as-a-Service (GPUaaS) suppliers undertake AI, infrastructure effectivity is a vital consideration. The answer from F5 and NVIDIA goals to handle these components, together with token throughput and value per token.

The shift from application-centric to agent-driven AI workflows requires architectural approaches that enhance token throughput and cut back prices. BIG-IP Subsequent for Kubernetes now makes use of NVIDIA NIM statistics and GPU telemetry to make routing choices for inferences. This matches workloads with applicable accelerators in actual time, aiming to enhance utilisation and cut back latency.

Assessments validated by The Tolly Group demonstrated elevated token throughput, sooner time to first token (TTFT), and lowered request latency. Offloading capabilities similar to networking and AI-aware load balancing to NVIDIA BlueField-3 DPUs permits host CPU capability to be preserved, enabling GPUs to carry out high-throughput inference. This will increase token yield and reduces prices with out requiring modifications to AI fashions.

AI functions require visitors management past conventional load balancing. BIG-IP Subsequent for Kubernetes now helps inference-aware routing for agent-driven AI duties. Integration with the NVIDIA DOCA Platform Framework facilitates deployment and administration of NVIDIA BlueField DPUs. These capabilities goal to permit organisations to share GPU infrastructure securely throughout items or purchasers whereas sustaining efficiency and repair predictability.

See also  Expanding access to digital infrastructure training

The collaboration between F5 and NVIDIA goals to supply instruments to observe token consumption, enhance visitors movement, and optimise infrastructure utilisation. This method seeks to permit organisations to attain higher effectivity from GPUs and higher align sources with AI workloads.

By combining NVIDIA infrastructure telemetry and DPU acceleration with F5 operational intelligence, enterprises can adapt AI infrastructures for extra environment friendly, multi-tenant, and agent-driven workloads.



Source link

TAGGED: collaboration, expand, infrastructure, Nvidia
Share This Article
Twitter Email Copy Link Print
Previous Article AI agents enter banking roles at Bank of America AI agents enter banking roles at Bank of America
Next Article Corning and US Conec to strengthen AI networks with PRIZM TMT technology Corning and US Conec to strengthen AI networks with PRIZM TMT technology
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Navigating the EU AI Act: Implications for UK businesses

The EU AI Act, which got here into impact on August 1, 2024, marks a…

April 7, 2025

AI Trends and Predictions 2025 From Industry Insiders

This time final 12 months, AI — generative AI, particularly — was principally hype. Lots…

January 17, 2025

AI speech model cuts healthcare transcription errors

Deepgram has unveiled Nova-3 Medical, an AI speech-to-text (STT) mannequin tailor-made for transcription within the…

March 4, 2025

Web3 tech helps instil confidence and trust in AI

The promise of AI is that it’ll make all of our lives simpler. And with…

April 9, 2025

Korean Governor signs $35B AI deal, Alphabet chairman joins board

In an unprecedented transfer, Governor Kim Yung-Rok of South Korea's Jeollanam-do Province travelled to the…

February 27, 2025

You Might Also Like

Russelectric introduces advanced transfer switch systems for power transition management
Power & Cooling

Russelectric introduces advanced transfer switch systems for power transition management

By saad
STL launches Neuralis data centre connectivity suite in the U.S.
Power & Cooling

STL launches Neuralis data centre connectivity suite in the U.S.

By saad
BAC launches TrilliumSeries dry cooler for water-efficient cooling
Power & Cooling

BAC launches TrilliumSeries dry cooler for water-efficient cooling

By saad
Airsys enhances cooling solutions with the UniCool-Max
Power & Cooling

Airsys enhances cooling solutions with the UniCool-Max

By saad

About Us

Data Center News is your dedicated source for data center infrastructure, AI compute, cloud, and industry news.

Top Categories

  • AI & Compute
  • Cloud Computing
  • Power & Cooling
  • Colocation
  • Security
  • Infrastructure
  • Sustainability
  • Industry News

Useful Links

  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

Find Us on Socials

© 2026 Data Center News. All Rights Reserved.

© 2026 Data Center News. All Rights Reserved.
Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.