Wednesday, 25 Mar 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > Power & Cooling > F5 and NVIDIA expand collaboration on AI infrastructure
Power & Cooling

F5 and NVIDIA expand collaboration on AI infrastructure

Last updated: March 25, 2026 2:23 pm
Published March 25, 2026
Share
F5 and NVIDIA expand collaboration on AI infrastructure
SHARE

F5, a supplier of software and API supply and safety options, has introduced expanded capabilities in collaboration with NVIDIA to boost AI inference infrastructures. This collaboration integrates F5 BIG-IP Subsequent for Kubernetes with NVIDIA BlueField-3 DPUs, making a telemetry-aware infrastructure layer. The combination is designed to extend token throughput by means of improved GPU utilisation, cut back latency, and assist safe multi-tenant AI platforms at scale.

In AI methods, tokens are measurable items of AI output, similar to phrases or knowledge fragments generated throughout inference. The manufacturing fee of those tokens impacts person expertise, infrastructure effectivity, and income per accelerator. As companies and GPU-as-a-Service (GPUaaS) suppliers undertake AI, infrastructure effectivity is a vital consideration. The answer from F5 and NVIDIA goals to handle these components, together with token throughput and value per token.

The shift from application-centric to agent-driven AI workflows requires architectural approaches that enhance token throughput and cut back prices. BIG-IP Subsequent for Kubernetes now makes use of NVIDIA NIM statistics and GPU telemetry to make routing choices for inferences. This matches workloads with applicable accelerators in actual time, aiming to enhance utilisation and cut back latency.

Assessments validated by The Tolly Group demonstrated elevated token throughput, sooner time to first token (TTFT), and lowered request latency. Offloading capabilities similar to networking and AI-aware load balancing to NVIDIA BlueField-3 DPUs permits host CPU capability to be preserved, enabling GPUs to carry out high-throughput inference. This will increase token yield and reduces prices with out requiring modifications to AI fashions.

AI functions require visitors management past conventional load balancing. BIG-IP Subsequent for Kubernetes now helps inference-aware routing for agent-driven AI duties. Integration with the NVIDIA DOCA Platform Framework facilitates deployment and administration of NVIDIA BlueField DPUs. These capabilities goal to permit organisations to share GPU infrastructure securely throughout items or purchasers whereas sustaining efficiency and repair predictability.

See also  Nvidia, Google Cloud team to boost AI startups

The collaboration between F5 and NVIDIA goals to supply instruments to observe token consumption, enhance visitors movement, and optimise infrastructure utilisation. This method seeks to permit organisations to attain higher effectivity from GPUs and higher align sources with AI workloads.

By combining NVIDIA infrastructure telemetry and DPU acceleration with F5 operational intelligence, enterprises can adapt AI infrastructures for extra environment friendly, multi-tenant, and agent-driven workloads.



Source link

TAGGED: collaboration, expand, infrastructure, Nvidia
Share This Article
Twitter Email Copy Link Print
Previous Article TE Connectivity reveals 56G MezzaWave Connectors for high-speed applications TE Connectivity reveals 56G MezzaWave Connectors for high-speed applications
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Mistral AI makes waves with deals across tech giants including Microsoft, IBM

Paris-based startup Mistral AI has cemented itself as a rising star within the synthetic intelligence…

March 1, 2024

Data Centers Will Drive Demand for Natural Gas, TC Energy Says | DCN

(Bloomberg) -- The expansion of power-hungry knowledge facilities will ship demand hovering for pure gasoline…

May 6, 2024

Updated software improves slicing for large-format 3D printing

The total course of for additive manufacturing as represented by a sphere to point out…

July 2, 2024

True Anomaly Raises $260M in Series C Funding

True Anomaly, a Centennial, CO-based area safety firm, raised $260M in Sequence C funding. The…

April 30, 2025

Broadcom ties Private AI to VMware Cloud Foundation rollout

Broadcom is increasing VMware Cloud Basis (VCF) with new AI and safety features as extra…

August 28, 2025

You Might Also Like

Nscale lands $2B to expand global AI infrastructure platform
Edge Computing

Nscale lands $2B to expand global AI infrastructure platform

By saad
Pilot Photonics and Finchetto collaborate on next-gen data centre switches
Power & Cooling

Pilot Photonics and Finchetto collaborate on next-gen data centre switches

By saad
HPE introduces AI Grid solution with NVIDIA for distributed AI infrastructure
Power & Cooling

HPE introduces AI Grid solution with NVIDIA for distributed AI infrastructure

By saad
Data centre microgrid deployed in Europe
Power & Cooling

Data centre microgrid deployed in Europe

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.