Monday, 13 Apr 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > Cloud Computing > How cloud providers are tackling GPU shortages with custom chips
Cloud Computing

How cloud providers are tackling GPU shortages with custom chips

Last updated: December 6, 2024 7:40 pm
Published December 6, 2024
Share
How cloud providers are tackling GPU shortages with custom chips
SHARE

GPUs are the spine of AI computing, however as demand exceeds provide, cloud suppliers are getting artistic.

As a substitute of ready for extra GPUs, as Network World reported, they’re creating customized chips to fulfill particular workloads, delivering quicker, extra environment friendly computing whereas maintaining prices below management.

The competitors is heating up. At Microsoft’s Ignite convention final week, the corporate unveiled two new chips designed to spice up efficiency for its Azure platform. All eyes are actually on AWS, because it gears up for its personal, customized silicon portfolio.

Why customized chips matter

GPUs have revolutionised duties like coaching AI fashions, however they’re not at all times one of the best instrument for the job. They arrive with vital drawbacks: excessive energy consumption, intensive cooling wants, and, proper now, a worldwide scarcity. Nvidia’s newest GPUs stock is spoken for, for the following 12 months.

Customized accelerators are stepping in to fill the hole. Mario Morales, vp analyst at IDC, highlights the rising significance of options to GPUs: “These accelerators have gotten more and more necessary in cloud infrastructure attributable to their superior price-performance and price-efficiency ratios, which result in higher return on investments.”

AWS and Google have been rolling out customized chips for years—AWS with Trainium and Inferentia, and Google with Tensor Processing Items (TPUs). Microsoft, nonetheless, was late to hitch the customized silicon pattern. It wasn’t till final 12 months that the corporate launched its first customized chips, Maia and Cobalt, geared toward bettering power effectivity and dealing with AI workloads.

See also  Cloud Transformation Conference Global is opening its doors! 

This 12 months, Microsoft has stepped up its sport, introducing two new chips:

  • Azure Increase DPU: Designed to optimise information processing by operating a customized working system.
  • Azure Built-in HSM: Centered on safety, it retains encryption and signing keys securely in {hardware}.

Microsoft’s Azure Increase DPU is a step ahead, nevertheless it nonetheless lags behind opponents within the DPU area. Forrester senior analyst Alvin Nguyen notes that Google’s E2000 IPU, co-developed with Intel, and AWS’s Nitro system are each already well-established. Different cloud suppliers, together with Nvidia with its Bluefield chips and AMD with Pensando, are jockeying for place.

That stated, Microsoft is making notable developments in infrastructure. The corporate introduced new liquid-cooling options for AI servers and a power-efficient rack design co-developed with Meta, which may pack 35% extra AI accelerators into every rack.

Safety will get a customized enhance

Safety is one other space the place customized silicon is making progress. Microsoft’s new HSM chip is a devoted resolution for encryption duties that will historically require a mixture of {hardware} and software program. Nguyen notes this strategy reduces latency and enhances scalability, making it an addition value contemplating.

AWS and Google are additionally utilizing customized chips for safety. AWS Nitro prevents major system CPUs from modifying firmware, and Google’s Titan establishes ‘a safe root of belief’ for validating system well being.

Every supplier has its personal strategy, Nguyen explains. “Whereas Nitro gives the important safety operate of making certain that the primary system CPUs can not replace firmware in naked steel mode, Titan gives a hardware-based root of belief that establishes the robust id of a machine, with which we will make necessary safety selections and validate the well being of the system.”

See also  Qualcomm Unveils Data Center CPUs Compatible with NVIDIA Chips

The way forward for customized chips within the cloud

The push for customized silicon isn’t slowing. Based on Alexander Harrowell, principal analyst at Omdia, it’s a logical transfer for hyperscalers to spend money on these chips to cut back prices and enhance effectivity.

Because the demand for quicker, extra specialised computing grows, customized chips are a technique for cloud suppliers to remain aggressive. With innovation in overdrive, the race to redefine cloud efficiency is simply beginning.

(Picture by Unsplash)

See additionally: IBM needs Nvidia GPUs, and AWS may be the reply

Need to be taught extra about cybersecurity and the cloud from trade leaders? Try Cyber Security & Cloud Expo happening in Amsterdam, California, and London. Discover different upcoming enterprise know-how occasions and webinars powered by TechForge here.

Tags: AI, cloud, microsoft

Source link

TAGGED: Chips, cloud, custom, GPU, Providers, shortages, Tackling
Share This Article
Twitter Email Copy Link Print
Previous Article Meta launches Llama 3.3, shrinking powerful 405B open model Meta launches Llama 3.3, shrinking powerful 405B open model
Next Article Pepeto and Pepe Unchained Compete for Dominance in the Next Memecoin Era Pepeto and Pepe Unchained Compete for Dominance in the Next Memecoin Era
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Aramco Digital and Intel to establish Saudi Arabia’s first Open RAN development centre

Aramco Digital and Intel plan to establish Saudi Arabia’s inaugural Open RAN (Radio Access Network)…

January 22, 2024

Biden administration invests $269M to boost edge computing capabilities in microelectronics sector

The Biden-Harris Administration introduced $269 million in funding for microelectronics manufacturing and workforce growth. This…

September 25, 2024

Strong Q2 Demand for IT and Business Services in Americas, Says ISG

The newest quarterly report from Data Providers Group (ISG), a worldwide expertise analysis and advisory…

July 15, 2025

Next-generation secure, defined internet with SCION architecture

The web was constructed in additional easy, harmless instances and was seized on by a…

September 27, 2024

Swedish cloud and AI infrastructure to see Microsoft investment | IceNews

Microsoft has introduced that it'll make investments 33.7 billion Swedish Krona over two years to…

June 9, 2024

You Might Also Like

Red Hat expands collaboration with Google Cloud to strengthen application modernisation
Design

Red Hat expands collaboration with Google Cloud to strengthen application modernisation

By saad
Netzwerken, Karriereplanung
Global Market

Intel secures Google cloud and AI infrastructure deal

By saad
ControlMonkey expands cloud configuration disaster recovery for improved resilience
Infrastructure

ControlMonkey expands cloud configuration disaster recovery for improved resilience

By saad
CoreWeave secures AI cloud capacity deal with Meta through 2032
Design

CoreWeave secures AI cloud capacity deal with Meta through 2032

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.