Thursday, 14 May 2026
Subscribe
logo
  • AI Compute
  • Infrastructure
  • Power & Cooling
  • Security
  • Colocation
  • Cloud Computing
  • More
    • Sustainability
    • Industry News
    • About Data Center News
    • Terms & Conditions
Font ResizerAa
Data Center NewsData Center News
Search
  • AI Compute
  • Infrastructure
  • Power & Cooling
  • Security
  • Colocation
  • Cloud Computing
  • More
    • Sustainability
    • Industry News
    • About Data Center News
    • Terms & Conditions
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > Power & Cooling > Unifying AI management: Datadog launches GPU Monitoring
Power & Cooling

Unifying AI management: Datadog launches GPU Monitoring

Last updated: April 22, 2026 7:50 pm
Published April 22, 2026
Share
Unifying AI management: Datadog launches GPU Monitoring
SHARE

Datadog has launched GPU Monitoring, now obtainable to clients globally. The product is designed to deal with challenges organisations face in managing rising AI-related prices.

“GPU situations account for 14 p.c of compute prices—which is a large challenge as firms are struggling to construct AI-first know-how in scalable and sensible methods. Whereas these firms can see their prices climbing, they will’t chargeback GPU spend throughout enterprise items, see workload context or determine clear subsequent steps for enchancment. In consequence, it is rather difficult to funds and plan in considerate methods,” mentioned Yanbing Li, Chief Product Officer at Datadog.

The launch comes as firms search more practical methods to handle GPU spending linked to AI workloads. Many organisations face difficulties allocating GPU prices throughout enterprise items, and restricted workload context could make budgeting and planning extra complicated.

GPU Monitoring goals to offer a unified view throughout AI infrastructure, linking GPU fleet well being, value, and efficiency to the groups utilizing these sources. This helps quicker troubleshooting of slower workloads and goals to enhance value visibility.

As AI deployments scale, managing compute sources more and more entails broader organisational planning, significantly the place capability is misallocated or the place coaching and inference workloads are affected by value or efficiency constraints. Many organisations presently work with fragmented visibility into GPU utilization. GPU Monitoring is meant to consolidate this view.

Present GPU monitoring instruments usually present primary {hardware} well being metrics however could not present cross-team useful resource competition, causes for failed workloads, or determine underused units. This may sluggish investigations and result in overprovisioning as a precaution, contributing to increased useful resource utilization.

See also  Infinium launches edge immersion cooling for AI and HPC data centres

By connecting GPU fleet telemetry with workload information, GPU Monitoring supplies a shared view for platform engineering and machine studying groups.

  • Scale AI with out overspending: Utilization insights assist information capability planning, assist selections on new GPU purchases versus reallocation, and enhance value predictability.
  • Speed up AI supply: Linking efficiency points to particular GPUs and processes helps determine bottlenecks extra rapidly.
  • Keep away from pricey disruptions: Early detection of unhealthy GPUs may help cut back the chance of broader system failures.
  • Maximise ROI on GPU spend: Visibility into utilisation permits groups to determine underused or overprovisioned sources and regulate allocation accordingly.

Total, GPU Monitoring is positioned as a software to enhance visibility and useful resource administration for AI workloads throughout organisations.



Source link

TAGGED: Datadog, GPU, launches, management, monitoring, Unifying
Share This Article
Twitter Email Copy Link Print
Previous Article Advancing AI development in Singapore through infrastructure improvements Advancing AI development in Singapore through infrastructure improvements
Next Article Reversing enterprise security costs with AI vulnerability discovery Reversing enterprise security costs with AI vulnerability discovery
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Europe’s data centre outlook: data centre truths 2026

BCS Consultancy, a world information centre consultancy, has launched its newest report, Information Centre Truths…

February 18, 2026

Beyond single-model AI: How architectural design drives reliable multi-agent orchestration

Be part of our every day and weekly newsletters for the newest updates and unique…

May 25, 2025

Equinix pioneers next-generation energy solutions for data centres

Equinix, Inc. (Nasdaq: EQIX), recognised globally for digital infrastructure, is actively collaborating with main vitality…

August 14, 2025

Swave Photonics raises $28.3M for 3D holographic smartglasses and displays

Be a part of our each day and weekly newsletters for the newest updates and…

January 4, 2025

Google’s Gemini 2.5 Flash introduces ‘thinking budgets’ that cut AI costs by 600% when turned down

Be a part of our day by day and weekly newsletters for the newest updates…

April 19, 2025

You Might Also Like

Russelectric introduces advanced transfer switch systems for power transition management
Power & Cooling

Russelectric introduces advanced transfer switch systems for power transition management

By saad
IBM launches AI platform Bob to regulate SDLC costs
AI & Compute

IBM launches AI platform Bob to regulate SDLC costs

By saad
STL launches Neuralis data centre connectivity suite in the U.S.
Power & Cooling

STL launches Neuralis data centre connectivity suite in the U.S.

By saad
BAC launches TrilliumSeries dry cooler for water-efficient cooling
Power & Cooling

BAC launches TrilliumSeries dry cooler for water-efficient cooling

By saad

About Us

Data Center News is your dedicated source for data center infrastructure, AI compute, cloud, and industry news.

Top Categories

  • AI & Compute
  • Cloud Computing
  • Power & Cooling
  • Colocation
  • Security
  • Infrastructure
  • Sustainability
  • Industry News

Useful Links

  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

Find Us on Socials

© 2026 Data Center News. All Rights Reserved.

© 2026 Data Center News. All Rights Reserved.
Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.