Friday, 10 Apr 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > Ant Group uses domestic chips to train AI models and cut costs
AI

Ant Group uses domestic chips to train AI models and cut costs

Last updated: April 3, 2025 8:04 pm
Published April 3, 2025
Share
Ant Group uses domestic chips to train AI models and cut costs
SHARE

Ant Group is counting on Chinese language-made semiconductors to coach synthetic intelligence fashions to scale back prices and reduce dependence on restricted US know-how, in response to folks acquainted with the matter.

The Alibaba-owned firm has used chips from home suppliers, together with these tied to its mum or dad, Alibaba, and Huawei Applied sciences to coach massive language fashions utilizing the Combination of Consultants (MoE) methodology. The outcomes had been reportedly corresponding to these produced with Nvidia’s H800 chips, sources declare. Whereas Ant continues to make use of Nvidia chips for a few of its AI improvement, one sources stated the corporate is popping more and more to options from AMD and Chinese language chip-makers for its newest fashions.

The event indicators Ant’s deeper involvement within the rising AI race between Chinese language and US tech companies, significantly as corporations search for cost-effective methods to coach fashions. The experimentation with home {hardware} displays a broader effort amongst Chinese language companies to work round export restrictions that block entry to high-end chips like Nvidia’s H800, which, though not probably the most superior, continues to be one of many extra highly effective GPUs accessible to Chinese language organisations.

Ant has revealed a analysis paper describing its work, stating that its fashions, in some checks, carried out higher than these developed by Meta. Bloomberg News, which initially reported the matter, has not verified the corporate’s outcomes independently. If the fashions carry out as claimed, Ant’s efforts could symbolize a step ahead in China’s try to decrease the price of working AI functions and scale back the reliance on overseas {hardware}.

See also  OpenAI pulls free GPT-4o image generator after one day

MoE fashions divide duties into smaller knowledge units dealt with by separate elements, and have gained consideration amongst AI researchers and knowledge scientists. The method has been utilized by Google and the Hangzhou-based startup, DeepSeek. The MoE idea is just like having a staff of specialists, every dealing with a part of a process to make the method of manufacturing fashions extra environment friendly. Ant has declined to touch upon its work with respect to its {hardware} sources.

Coaching MoE fashions relies on high-performance GPUs which will be too costly for smaller corporations to amass or use. Ant’s analysis centered on lowering that price barrier. The paper’s title is suffixed with a transparent goal: Scaling Fashions “with out premium GPUs.” [our quotation marks]

The path taken by Ant and the usage of MoE to scale back coaching prices distinction with Nvidia’s strategy. CEO Officer Jensen Huang has stated that demand for computing energy will proceed to develop, even with the introduction of extra environment friendly fashions like DeepSeek’s R1. His view is that corporations will search extra highly effective chips to drive income development, reasonably than aiming to chop prices with cheaper options. Nvidia’s technique stays centered on constructing GPUs with extra cores, transistors, and reminiscence.

Based on the Ant Group paper, coaching one trillion tokens – the essential models of knowledge AI fashions use to be taught – price about 6.35 million yuan (roughly $880,000) utilizing typical high-performance {hardware}. The corporate’s optimised coaching methodology diminished that price to round 5.1 million yuan by utilizing lower-specification chips.

See also  Hugging Face: 5 ways enterprises can slash AI costs without sacrificing performance 

Ant stated it plans to use its fashions produced on this approach – Ling-Plus and Ling-Lite – to industrial AI use circumstances like healthcare and finance. Earlier this yr, the corporate acquired Haodf.com, a Chinese language on-line medical platform, to additional Ant’s ambition to deploy AI-based options in healthcare. It additionally operates different AI companies, together with a digital assistant app referred to as Zhixiaobao and a monetary advisory platform generally known as Maxiaocai.

“For those who discover one level of assault to beat the world’s finest kung fu grasp, you may nonetheless say you beat them, which is why real-world software is vital,” stated Robin Yu, chief know-how officer of Beijing-based AI agency, Shengshang Tech.

Ant has made its fashions open supply. Ling-Lite has 16.8 billion parameters – settings that assist decide how a mannequin capabilities – whereas Ling-Plus has 290 billion. For comparability, estimates recommend closed-source GPT-4.5 has round 1.8 trillion parameters, in response to MIT Know-how Assessment.

Regardless of progress, Ant’s paper famous that coaching fashions stays difficult. Small changes to {hardware} or mannequin construction throughout mannequin coaching generally resulted in unstable efficiency, together with spikes in error charges.

(Photograph by Unsplash)

See additionally: DeepSeek V3-0324 tops non-reasoning AI fashions in open-source first

Wish to be taught extra about AI and large knowledge from trade leaders? Try AI & Big Data Expo going down in Amsterdam, California, and London. The great occasion is co-located with different main occasions together with Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Discover different upcoming enterprise know-how occasions and webinars powered by TechForge here.

See also  Ex-OpenAI CTO Mira Murati unveils Thinking Machines: A startup focused on multimodality, human-AI collaboration

Source link

TAGGED: Ant, Chips, Costs, Cut, Domestic, Group, models, train
Share This Article
Twitter Email Copy Link Print
Previous Article Pulsant to acquire two data centres from SCC Pulsant to acquire two data centres from SCC
Next Article Redpanda Redpanda Raises $100M in Series D; Valued at $1 Billion
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Optical ground stations capture satellite laser signals, paving way for 1,000-fold faster communications from space

College students working with the cellular optical communications community – TeraNet 3. Credit score: ICRAR…

July 16, 2024

Equinix and PGIM Real Estate Enter Into $600 Million JV for First xScale® Data Center in the U.S.

Copyright 2024 PR Newswire. All Rights Reserved2024-04-15 REDWOOD CITY, Calif., April 15, 2024 /PRNewswire/ -- Equinix,…

April 15, 2024

AI Partnerships and Brand-New Builds

With knowledge heart information transferring sooner than ever, we need to make it straightforward for…

September 20, 2024

OpenAI, Oracle Plan Multi-Billion-Dollar AI Data Center in Michigan

OpenAI, Oracle, and Associated Digital have introduced plans to construct one of many largest new…

October 31, 2025

CrowdStrike’s faulty update crashed 8.5 million Windows devices, says Microsoft

CrowdStrike’s defective replace brought about a worldwide tech catastrophe that affected 8.5 million Home windows…

July 21, 2024

You Might Also Like

How robust AI governance protects enterprise margins
AI

How robust AI governance protects enterprise margins

By saad
Heat emission from the chimneys of a large data and server complex.
Global Market

OpenAI puts part of Stargate project on hold over runaway power costs

By saad
Why companies like Apple are building AI agents with limits
AI

Why companies like Apple are building AI agents with limits

By saad
Agentic AI's governance challenges under the EU AI Act in 2026
AI

Agentic AI’s governance challenges under the EU AI Act in 2026

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.