Sunday, 14 Dec 2025
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > LG EXAONE Deep is a maths, science, and coding buff
AI

LG EXAONE Deep is a maths, science, and coding buff

Last updated: March 18, 2025 5:43 pm
Published March 18, 2025
Share
Papercraft toy with code in the background as LG AI Research has unveiled EXAONE Deep, a reasoning artificial intelligence model that excels in complex problem-solving across maths, science, and coding.
SHARE

LG AI Analysis has unveiled EXAONE Deep, a reasoning mannequin that excels in complicated problem-solving throughout maths, science, and coding.

The corporate highlighted the worldwide problem in creating superior reasoning fashions, noting that presently, solely a handful of organisations with foundational fashions are actively pursuing this complicated space. EXAONE Deep goals to compete instantly with these main fashions, showcasing a aggressive degree of reasoning skill.

LG AI Analysis has targeted its efforts on dramatically bettering EXAONE Deep’s reasoning capabilities in core domains. The mannequin additionally demonstrates a powerful skill to grasp and apply data throughout a broader vary of topics.

The efficiency benchmarks launched by LG AI Analysis are spectacular:

  • Maths: The EXAONE Deep 32B mannequin outperformed a competing mannequin, regardless of being solely 5% of its measurement, in a demanding arithmetic benchmark. Moreover, the 7.8B and a pair of.4B variations achieved first place in all main arithmetic benchmarks for his or her respective mannequin sizes.
  • Science and coding: In these areas, the EXAONE Deep fashions (7.8B and a pair of.4B) have secured the highest spot throughout all main benchmarks.
  • MMLU (Large Multitask Language Understanding): The 32B mannequin achieved a rating of 83.0 on the MMLU benchmark, which LG AI Analysis claims is the very best efficiency amongst home Korean fashions.

The capabilities of the EXAONE Deep 32B mannequin have already garnered worldwide recognition.

Shortly after its launch, it was included within the ‘Notable AI Fashions’ record by US-based non-profit analysis organisation Epoch AI. This itemizing locations EXAONE Deep alongside its predecessor, EXAONE 3.5, making LG the one Korean entity with fashions featured on this prestigious record up to now two years.

See also  AI race ‘is far from over’

Maths prowess

EXAONE Deep has demonstrated distinctive mathematical reasoning abilities throughout its varied mannequin sizes (32B, 7.8B, and a pair of.4B). In assessments based mostly on the 2025 tutorial 12 months’s arithmetic curriculum, all three fashions outperformed international reasoning fashions of comparable measurement.

The 32B mannequin achieved a rating of 94.5 in a common arithmetic competency take a look at and 90.0 within the American Invitational Arithmetic Examination (AIME) 2024, a qualifying examination for the US Mathematical Olympiad.

Within the AIME 2025, the 32B mannequin matched the efficiency of DeepSeek-R1—a considerably bigger 671B mannequin. This outcome showcases EXAONE Deep’s environment friendly studying and powerful logical reasoning skills, significantly when tackling difficult mathematical issues.

The smaller 7.8B and a pair of.4B fashions additionally achieved prime rankings in main benchmarks for light-weight and on-device fashions, respectively. The 7.8B mannequin scored 94.8 on the MATH-500 benchmark and 59.6 on AIME 2025, whereas the two.4B mannequin achieved scores of 92.3 and 47.9 in the identical evaluations.

Science and coding excellence

EXAONE Deep has additionally showcased exceptional capabilities in skilled science reasoning and software program coding.

The 32B mannequin scored 66.1 on the GPQA Diamond take a look at, which assesses problem-solving abilities in doctoral-level physics, chemistry, and biology. Within the LiveCodeBench analysis, which measures coding proficiency, the mannequin achieved a rating of 59.5, indicating its potential for high-level functions in these skilled domains.

The 7.8B and a pair of.4B fashions continued this development of sturdy efficiency, each securing first place within the GPQA Diamond and LiveCodeBench benchmarks inside their respective measurement classes. This achievement builds upon the success of the EXAONE 3.5 2.4B mannequin, which beforehand topped Hugging Face’s LLM Readerboard within the edge division.

See also  Cambridge Future Tech and Arup sign MoU to develop Deep Rack Venture Studio to revolutionise data centre infrastructure

Enhanced common data

Past its specialised reasoning capabilities, EXAONE Deep has additionally demonstrated improved efficiency on the whole data understanding.

The 32B mannequin achieved a powerful rating of 83.0 on the MMLU benchmark, positioning it because the top-performing home mannequin on this complete analysis. This means that EXAONE Deep’s reasoning enhancements prolong past particular domains and contribute to a broader understanding of assorted topics.

LG AI Analysis believes that EXAONE Deep’s reasoning developments symbolize a leap in the direction of a future the place AI can sort out more and more complicated issues and contribute to enriching and simplifying human lives by means of steady analysis and innovation.

See additionally: Baidu undercuts rival AI fashions with ERNIE 4.5 and ERNIE X1

Need to study extra about AI and massive knowledge from trade leaders? Take a look at AI & Big Data Expo going down in Amsterdam, California, and London. The great occasion is co-located with different main occasions together with Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Discover different upcoming enterprise know-how occasions and webinars powered by TechForge here.

Source link

TAGGED: buff, coding, deep, EXAONE, maths, science
Share This Article
Twitter Email Copy Link Print
Previous Article iAgent Secures $3M, Launches $AGNT, and will Introduce a New ERC-** Standard for AI Agents iAgent Secures $3M, Launches $AGNT, and will Introduce a New ERC-** Standard for AI Agents
Next Article rediem Raises $1.2 in Pre-Seed Funding Daisy Closes $3.9M Seed Funding
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Nous Research drops Hermes 4 AI models that outperform ChatGPT without content restrictions

Need smarter insights in your inbox? Join our weekly newsletters to get solely what issues…

August 29, 2025

Sema4.ai Raises $25M in Series A Extension

Sema4.ai, an Atlanta, GA-based enterprise AI agent firm, raised $25m in Sequence A extension spherical,…

June 5, 2025

Reigniting the European digital economy’s €200bn AI ambitions

There's a sense of urgency in Europe to re-imagine the established order and reshape know-how…

April 24, 2025

Accelsius secures $24m in Series A funding

Accelsius has introduced the profitable increase of $24 million pursuant to a Collection A funding…

November 14, 2024

The Ultimate Guide to Conducting a UX Audit for Enhanced Market Performance

In right now’s digital age, the place person expertise (UX) holds paramount significance, companies are…

June 6, 2024

You Might Also Like

Enterprise users swap AI pilots for deep integrations
AI

Enterprise users swap AI pilots for deep integrations

By saad
Why most enterprise AI coding pilots underperform (Hint: It's not the model)
AI

Why most enterprise AI coding pilots underperform (Hint: It's not the model)

By saad
Newsweek: Building AI-resilience for the next era of information
AI

Newsweek: Building AI-resilience for the next era of information

By saad
Google’s new framework helps AI agents spend their compute and tool budget more wisely
AI

Google’s new framework helps AI agents spend their compute and tool budget more wisely

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.