Sunday, 8 Feb 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > Ant Group uses domestic chips to train AI models and cut costs
AI

Ant Group uses domestic chips to train AI models and cut costs

Last updated: April 3, 2025 8:04 pm
Published April 3, 2025
Share
Ant Group uses domestic chips to train AI models and cut costs
SHARE

Ant Group is counting on Chinese language-made semiconductors to coach synthetic intelligence fashions to scale back prices and reduce dependence on restricted US know-how, in response to folks acquainted with the matter.

The Alibaba-owned firm has used chips from home suppliers, together with these tied to its mum or dad, Alibaba, and Huawei Applied sciences to coach massive language fashions utilizing the Combination of Consultants (MoE) methodology. The outcomes had been reportedly corresponding to these produced with Nvidia’s H800 chips, sources declare. Whereas Ant continues to make use of Nvidia chips for a few of its AI improvement, one sources stated the corporate is popping more and more to options from AMD and Chinese language chip-makers for its newest fashions.

The event indicators Ant’s deeper involvement within the rising AI race between Chinese language and US tech companies, significantly as corporations search for cost-effective methods to coach fashions. The experimentation with home {hardware} displays a broader effort amongst Chinese language companies to work round export restrictions that block entry to high-end chips like Nvidia’s H800, which, though not probably the most superior, continues to be one of many extra highly effective GPUs accessible to Chinese language organisations.

Ant has revealed a analysis paper describing its work, stating that its fashions, in some checks, carried out higher than these developed by Meta. Bloomberg News, which initially reported the matter, has not verified the corporate’s outcomes independently. If the fashions carry out as claimed, Ant’s efforts could symbolize a step ahead in China’s try to decrease the price of working AI functions and scale back the reliance on overseas {hardware}.

See also  Trump jokes about AI while US and UK sign new tech deal

MoE fashions divide duties into smaller knowledge units dealt with by separate elements, and have gained consideration amongst AI researchers and knowledge scientists. The method has been utilized by Google and the Hangzhou-based startup, DeepSeek. The MoE idea is just like having a staff of specialists, every dealing with a part of a process to make the method of manufacturing fashions extra environment friendly. Ant has declined to touch upon its work with respect to its {hardware} sources.

Coaching MoE fashions relies on high-performance GPUs which will be too costly for smaller corporations to amass or use. Ant’s analysis centered on lowering that price barrier. The paper’s title is suffixed with a transparent goal: Scaling Fashions “with out premium GPUs.” [our quotation marks]

The path taken by Ant and the usage of MoE to scale back coaching prices distinction with Nvidia’s strategy. CEO Officer Jensen Huang has stated that demand for computing energy will proceed to develop, even with the introduction of extra environment friendly fashions like DeepSeek’s R1. His view is that corporations will search extra highly effective chips to drive income development, reasonably than aiming to chop prices with cheaper options. Nvidia’s technique stays centered on constructing GPUs with extra cores, transistors, and reminiscence.

Based on the Ant Group paper, coaching one trillion tokens – the essential models of knowledge AI fashions use to be taught – price about 6.35 million yuan (roughly $880,000) utilizing typical high-performance {hardware}. The corporate’s optimised coaching methodology diminished that price to round 5.1 million yuan by utilizing lower-specification chips.

See also  Snowflake teams up with Mistral AI to integrate language models via Snowflake Cortex

Ant stated it plans to use its fashions produced on this approach – Ling-Plus and Ling-Lite – to industrial AI use circumstances like healthcare and finance. Earlier this yr, the corporate acquired Haodf.com, a Chinese language on-line medical platform, to additional Ant’s ambition to deploy AI-based options in healthcare. It additionally operates different AI companies, together with a digital assistant app referred to as Zhixiaobao and a monetary advisory platform generally known as Maxiaocai.

“For those who discover one level of assault to beat the world’s finest kung fu grasp, you may nonetheless say you beat them, which is why real-world software is vital,” stated Robin Yu, chief know-how officer of Beijing-based AI agency, Shengshang Tech.

Ant has made its fashions open supply. Ling-Lite has 16.8 billion parameters – settings that assist decide how a mannequin capabilities – whereas Ling-Plus has 290 billion. For comparability, estimates recommend closed-source GPT-4.5 has round 1.8 trillion parameters, in response to MIT Know-how Assessment.

Regardless of progress, Ant’s paper famous that coaching fashions stays difficult. Small changes to {hardware} or mannequin construction throughout mannequin coaching generally resulted in unstable efficiency, together with spikes in error charges.

(Photograph by Unsplash)

See additionally: DeepSeek V3-0324 tops non-reasoning AI fashions in open-source first

Wish to be taught extra about AI and large knowledge from trade leaders? Try AI & Big Data Expo going down in Amsterdam, California, and London. The great occasion is co-located with different main occasions together with Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Discover different upcoming enterprise know-how occasions and webinars powered by TechForge here.

See also  Baidu unveils proprietary ERNIE 5 beating GPT-5 performance on charts, document understanding and more

Source link

TAGGED: Ant, Chips, Costs, Cut, Domestic, Group, models, train
Share This Article
Twitter Email Copy Link Print
Previous Article Pulsant to acquire two data centres from SCC Pulsant to acquire two data centres from SCC
Next Article Redpanda Redpanda Raises $100M in Series D; Valued at $1 Billion
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Cradlepoint targets wireless access for SMEs with new 5G router

Cradlepoint, an organization specializing in cloud-based LTE and 5G wi-fi edge community options, has launched…

February 20, 2024

The Most Innovative Companies in USA

For a lot of a long time, the USA has been a pioneering nation, which…

March 7, 2025

Making a case for case statements on Linux

#!/bin/bash echo -n "enter the variety of equal sides that the form has> " …

May 23, 2024

How to Recycle IT Equipment & Reduce Impact

A lot of the dialog surrounding sustainability and the IT trade immediately focuses on decreasing…

June 1, 2024

Dedagroup Acquires Quod Orbis

Dedagroup, a Trento, Italy-based IT participant, acquired Quod Orbit, a London, UK-based steady controls monitoring…

July 4, 2024

You Might Also Like

SuperCool review: Evaluating the reality of autonomous creation
AI

SuperCool review: Evaluating the reality of autonomous creation

By saad
Top 7 best AI penetration testing companies in 2026
AI

Top 7 best AI penetration testing companies in 2026

By saad
Intuit, Uber, and State Farm trial AI agents inside enterprise workflows
AI

Intuit, Uber, and State Farm trial enterprise AI agents

By saad
How separating logic and search boosts AI agent scalability
AI

How separating logic and search boosts AI agent scalability

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.