Saturday, 7 Feb 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > Tencent releases versatile open-source Hunyuan AI models
AI

Tencent releases versatile open-source Hunyuan AI models

Last updated: August 4, 2025 7:56 pm
Published August 4, 2025
Share
Tencent releases versatile open-source Hunyuan AI models
SHARE

Tencent has expanded its household of open-source Hunyuan AI fashions which are versatile sufficient for broad use. This new household of fashions is engineered to ship highly effective efficiency throughout computational environments, from small edge units to demanding, high-concurrency manufacturing methods.

The discharge features a complete set of pre-trained and instruction-tuned fashions out there on the developer platform Hugging Face. The fashions are available in a number of sizes, particularly with parameter scales of 0.5B, 1.8B, 4B, and 7B, offering substantial flexibility for builders and companies.

Tencent has indicated that these fashions have been developed utilizing coaching methods much like its extra highly effective Hunyuan-A13B mannequin, permitting them to inherit its efficiency traits. This method allows customers to pick the optimum mannequin for his or her wants, whether or not it’s a smaller variant for resource-constrained edge computing or a bigger mannequin for high-throughput manufacturing workloads, all whereas making certain robust capabilities.

Probably the most notable options of the Hunyuan sequence is its native assist for an ultra-long 256K context window. This enables the fashions to deal with and keep steady efficiency on long-text duties, a significant functionality for complicated doc evaluation, prolonged conversations, and in-depth content material technology. The fashions assist what Tencent calls “hybrid reasoning,” which permits for each quick and sluggish pondering modes that customers can select between relying on their particular necessities.

The corporate has additionally positioned a robust emphasis on agentic capabilities. The fashions have been optimised for agent-based duties and have demonstrated main outcomes on established benchmarks corresponding to BFCL-v3, τ-Bench, and C3-Bench, suggesting a excessive diploma of proficiency in complicated, multi-step problem-solving. As an illustration, on the C3-Bench, the Hunyuan-7B-Instruct mannequin achieves a rating of 68.5, whereas the Hunyuan-4B-Instruct mannequin scores 64.3.

See also  DeepSeek to open-source AGI research amid privacy concerns

The sequence’ efficiency is a deal with environment friendly inference. Tencent’s Hunyuan fashions utilise Grouped Question Consideration (GQA), a method identified for enhancing processing velocity and decreasing computational overhead. This effectivity is additional enhanced by superior quantisation assist, a key component of the Hunyuan structure designed to decrease deployment limitations.

Tencent has developed its personal compression toolset, AngleSlim, to create a extra user-friendly and efficient mannequin compression resolution. Utilizing this instrument, the corporate affords two most important varieties of quantisation for the Hunyuan sequence.

The primary is FP8 static quantisation, which employs an 8-bit floating-point format. This methodology makes use of a small quantity of calibration information to pre-determine the quantisation scale with out requiring full retraining, changing mannequin weights and activation values into the FP8 format to spice up inference effectivity.

The second methodology is INT4 quantisation, which achieves W4A16 quantisation by the GPTQ and AWQ algorithms:

  • The GPTQ method processes mannequin weights layer by layer, utilizing calibration information to minimise errors within the quantised weights. This course of avoids requiring mannequin retraining and improves inference velocity.
  • The AWQ algorithm works by statistically analysing the amplitude of activation values from a small set of calibration information. It then calculates a scaling coefficient for every weight channel, which expands the numerical vary of vital weights to retain extra data through the compression course of. 

Builders can both use the AngleSlim instrument themselves or obtain the pre-quantised fashions instantly.

Efficiency benchmarks verify the robust capabilities of the Tencent Hunyuan fashions throughout a variety of duties. The pre-trained Hunyuan-7B mannequin, for instance, achieves a rating of 79.82 on the MMLU benchmark, 88.25 on GSM8K, and 74.85 on the MATH benchmark, demonstrating stable reasoning and mathematical expertise.

See also  Gartner Data & Analytics Summit unveils expanded AI agenda for 2026

The instruction-tuned variants present spectacular leads to specialised areas. In arithmetic, the Hunyuan-7B-Instruct mannequin scores 81.1 on the AIME 2024 benchmark, whereas the 4B model scores 78.3. In science, the 7B mannequin reaches 76.5 on OlympiadBench, and in coding, it scores 42 on Livecodebench.

🚀We’re increasing the Tencent Hunyuan open-source LLM ecosystem with 4 compact fashions (0.5B, 1.8B, 4B, 7B)! Designed for low-power eventualities like consumer-grade GPUs, good automobiles, good residence units, cell phones, and PCs, these fashions assist cost-effective fine-tuning… pic.twitter.com/CknskVqPem

— Hunyuan (@TencentHunyuan) August 4, 2025

The quantisation benchmarks present minimal efficiency degradation. On the DROP benchmark, the Hunyuan-7B-Instruct mannequin scores 85.9 in its base B16 format, 86.0 with FP8, and 85.7 with Int4 GPTQ, indicating that effectivity beneficial properties don’t come at a price to accuracy.

For deployment, Tencent recommends utilizing established frameworks like TensorRT-LLM, vLLM, or SGLang to serve the Hunyuan fashions and create OpenAI-compatible API endpoints, making certain they are often built-in easily into current improvement workflows. This mix of efficiency, effectivity, and deployment flexibility positions the Hunyuan sequence as a seamless highly effective contender in open-source AI.

See additionally: Deep Cogito v2: Open-source AI that hones its reasoning expertise

Need to be taught extra about AI and large information from trade leaders? Take a look at AI & Big Data Expo going down in Amsterdam, California, and London. The excellent occasion is co-located with different main occasions together with Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Discover different upcoming enterprise know-how occasions and webinars powered by TechForge here.

See also  AI in business intelligence: Caveat emptor



Source link

TAGGED: Hunyuan, models, opensource, releases, Tencent, Versatile
Share This Article
Twitter Email Copy Link Print
Previous Article Hyve Managed Hosting partners with Digital Realty to expand global operations Hyve Managed Hosting partners with Digital Realty to expand global operations
Next Article Is the EU AI Act a step in the right direction? Is the EU AI Act a step in the right direction?
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

PrettyDamnQuick Raises $25M in Series A Funding

PrettyDamnQuick, a NYC-based supplier of an operational information platform for worthwhile commerce, raised $25M in…

January 12, 2025

LogicFlo Raises $2.7M Funding

LogicFlo, a Boston, MA-based supplier of an AI agent platform for life sciences organizations, raised…

June 30, 2025

Self-positioning microdevices with circularly polarized luminescence enable adaptable 3D display

Schematic illustration of the adaptable spatial show and depth info interplay with high-performance CPL. Credit…

May 20, 2025

Uptime Institute launches Data Center Academy in Kingdom of Saudi Arabia

The Uptime Institute Knowledge Middle Academy goals to construct and strengthen a sustainable ecosystem for…

March 14, 2024

The Australia Data Center Market Size Will Witness Investments

AUSTRALIA DATA CENTER MARKET - INVESTMENT ANALYSIS & GROWTH OPPORTUNITIES 2024-2029In accordance with Arizton's newest…

May 21, 2024

You Might Also Like

SuperCool review: Evaluating the reality of autonomous creation
AI

SuperCool review: Evaluating the reality of autonomous creation

By saad
Top 7 best AI penetration testing companies in 2026
AI

Top 7 best AI penetration testing companies in 2026

By saad
Intuit, Uber, and State Farm trial AI agents inside enterprise workflows
AI

Intuit, Uber, and State Farm trial enterprise AI agents

By saad
How separating logic and search boosts AI agent scalability
AI

How separating logic and search boosts AI agent scalability

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.