Tuesday, 14 Apr 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > Power & Cooling > F5 and NVIDIA expand collaboration on AI infrastructure
Power & Cooling

F5 and NVIDIA expand collaboration on AI infrastructure

Last updated: March 25, 2026 2:23 pm
Published March 25, 2026
Share
F5 and NVIDIA expand collaboration on AI infrastructure
SHARE

F5, a supplier of software and API supply and safety options, has introduced expanded capabilities in collaboration with NVIDIA to boost AI inference infrastructures. This collaboration integrates F5 BIG-IP Subsequent for Kubernetes with NVIDIA BlueField-3 DPUs, making a telemetry-aware infrastructure layer. The combination is designed to extend token throughput by means of improved GPU utilisation, cut back latency, and assist safe multi-tenant AI platforms at scale.

In AI methods, tokens are measurable items of AI output, similar to phrases or knowledge fragments generated throughout inference. The manufacturing fee of those tokens impacts person expertise, infrastructure effectivity, and income per accelerator. As companies and GPU-as-a-Service (GPUaaS) suppliers undertake AI, infrastructure effectivity is a vital consideration. The answer from F5 and NVIDIA goals to handle these components, together with token throughput and value per token.

The shift from application-centric to agent-driven AI workflows requires architectural approaches that enhance token throughput and cut back prices. BIG-IP Subsequent for Kubernetes now makes use of NVIDIA NIM statistics and GPU telemetry to make routing choices for inferences. This matches workloads with applicable accelerators in actual time, aiming to enhance utilisation and cut back latency.

Assessments validated by The Tolly Group demonstrated elevated token throughput, sooner time to first token (TTFT), and lowered request latency. Offloading capabilities similar to networking and AI-aware load balancing to NVIDIA BlueField-3 DPUs permits host CPU capability to be preserved, enabling GPUs to carry out high-throughput inference. This will increase token yield and reduces prices with out requiring modifications to AI fashions.

AI functions require visitors management past conventional load balancing. BIG-IP Subsequent for Kubernetes now helps inference-aware routing for agent-driven AI duties. Integration with the NVIDIA DOCA Platform Framework facilitates deployment and administration of NVIDIA BlueField DPUs. These capabilities goal to permit organisations to share GPU infrastructure securely throughout items or purchasers whereas sustaining efficiency and repair predictability.

See also  Cadence adds Nvidia to digital twin tool for data center design

The collaboration between F5 and NVIDIA goals to supply instruments to observe token consumption, enhance visitors movement, and optimise infrastructure utilisation. This method seeks to permit organisations to attain higher effectivity from GPUs and higher align sources with AI workloads.

By combining NVIDIA infrastructure telemetry and DPU acceleration with F5 operational intelligence, enterprises can adapt AI infrastructures for extra environment friendly, multi-tenant, and agent-driven workloads.



Source link

TAGGED: collaboration, expand, infrastructure, Nvidia
Share This Article
Twitter Email Copy Link Print
Previous Article TE Connectivity reveals 56G MezzaWave Connectors for high-speed applications TE Connectivity reveals 56G MezzaWave Connectors for high-speed applications
Next Article AI, artificial intelligence Cisco goes all in on agentic AI security
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Grazzy Closes $4M Seed Funding

Grazzy, an Austin, TX-based supplier of a digital funds platform for hospitality and service-focused companies,…

July 28, 2024

What Are TPUs? A Guide to Tensor Processing Units

Neglect GPUs. For those who’re severe about AI {hardware}, you’ll prepare and serve fashions utilizing…

July 30, 2025

Meme Coin Communities Gear Up for the CoinMarketCap Crypto Awards

Dubai, UAE, February twenty sixth, 2024, Chainwire CoinMarketCap’s Crypto Awards 2024, the primary version of…

February 26, 2024

Steampipe unbundled: From cloud APIs to your database

We’ve seen how Steampipe can unify access to APIs, drive metasearch, enforce KPIs as code,…

February 9, 2024

SAS expands hosted managed services to AWS

Information and synthetic intelligence (AI) specialist, SAS, has expanded its SAS-hosted managed companies to Amazon…

April 23, 2024

You Might Also Like

Helping to de-risk data centre air and liquid cooling
Power & Cooling

Helping to de-risk data centre air and liquid cooling

By saad
Applied computing, Wipro Limited and Databricks partner to target energy optimisation
Power & Cooling

Applied computing, Wipro Limited and Databricks partner to target energy optimisation

By saad
Companies expand AI adoption while keeping control
AI

Companies expand AI adoption while keeping control

By saad
Vertiv introduces CoolPhase wall cooling system
Power & Cooling

Vertiv introduces CoolPhase wall cooling system

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.