Sunday, 1 Mar 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > Tencent releases versatile open-source Hunyuan AI models
AI

Tencent releases versatile open-source Hunyuan AI models

Last updated: August 4, 2025 7:56 pm
Published August 4, 2025
Share
Tencent releases versatile open-source Hunyuan AI models
SHARE

Tencent has expanded its household of open-source Hunyuan AI fashions which are versatile sufficient for broad use. This new household of fashions is engineered to ship highly effective efficiency throughout computational environments, from small edge units to demanding, high-concurrency manufacturing methods.

The discharge features a complete set of pre-trained and instruction-tuned fashions out there on the developer platform Hugging Face. The fashions are available in a number of sizes, particularly with parameter scales of 0.5B, 1.8B, 4B, and 7B, offering substantial flexibility for builders and companies.

Tencent has indicated that these fashions have been developed utilizing coaching methods much like its extra highly effective Hunyuan-A13B mannequin, permitting them to inherit its efficiency traits. This method allows customers to pick the optimum mannequin for his or her wants, whether or not it’s a smaller variant for resource-constrained edge computing or a bigger mannequin for high-throughput manufacturing workloads, all whereas making certain robust capabilities.

Probably the most notable options of the Hunyuan sequence is its native assist for an ultra-long 256K context window. This enables the fashions to deal with and keep steady efficiency on long-text duties, a significant functionality for complicated doc evaluation, prolonged conversations, and in-depth content material technology. The fashions assist what Tencent calls “hybrid reasoning,” which permits for each quick and sluggish pondering modes that customers can select between relying on their particular necessities.

The corporate has additionally positioned a robust emphasis on agentic capabilities. The fashions have been optimised for agent-based duties and have demonstrated main outcomes on established benchmarks corresponding to BFCL-v3, τ-Bench, and C3-Bench, suggesting a excessive diploma of proficiency in complicated, multi-step problem-solving. As an illustration, on the C3-Bench, the Hunyuan-7B-Instruct mannequin achieves a rating of 68.5, whereas the Hunyuan-4B-Instruct mannequin scores 64.3.

See also  Why we need to move away from anthropomorphic naming conventions in AI

The sequence’ efficiency is a deal with environment friendly inference. Tencent’s Hunyuan fashions utilise Grouped Question Consideration (GQA), a method identified for enhancing processing velocity and decreasing computational overhead. This effectivity is additional enhanced by superior quantisation assist, a key component of the Hunyuan structure designed to decrease deployment limitations.

Tencent has developed its personal compression toolset, AngleSlim, to create a extra user-friendly and efficient mannequin compression resolution. Utilizing this instrument, the corporate affords two most important varieties of quantisation for the Hunyuan sequence.

The primary is FP8 static quantisation, which employs an 8-bit floating-point format. This methodology makes use of a small quantity of calibration information to pre-determine the quantisation scale with out requiring full retraining, changing mannequin weights and activation values into the FP8 format to spice up inference effectivity.

The second methodology is INT4 quantisation, which achieves W4A16 quantisation by the GPTQ and AWQ algorithms:

  • The GPTQ method processes mannequin weights layer by layer, utilizing calibration information to minimise errors within the quantised weights. This course of avoids requiring mannequin retraining and improves inference velocity.
  • The AWQ algorithm works by statistically analysing the amplitude of activation values from a small set of calibration information. It then calculates a scaling coefficient for every weight channel, which expands the numerical vary of vital weights to retain extra data through the compression course of. 

Builders can both use the AngleSlim instrument themselves or obtain the pre-quantised fashions instantly.

Efficiency benchmarks verify the robust capabilities of the Tencent Hunyuan fashions throughout a variety of duties. The pre-trained Hunyuan-7B mannequin, for instance, achieves a rating of 79.82 on the MMLU benchmark, 88.25 on GSM8K, and 74.85 on the MATH benchmark, demonstrating stable reasoning and mathematical expertise.

See also  Stability AI releases Stable Code 3B for enhanced coding assistance

The instruction-tuned variants present spectacular leads to specialised areas. In arithmetic, the Hunyuan-7B-Instruct mannequin scores 81.1 on the AIME 2024 benchmark, whereas the 4B model scores 78.3. In science, the 7B mannequin reaches 76.5 on OlympiadBench, and in coding, it scores 42 on Livecodebench.

🚀We’re increasing the Tencent Hunyuan open-source LLM ecosystem with 4 compact fashions (0.5B, 1.8B, 4B, 7B)! Designed for low-power eventualities like consumer-grade GPUs, good automobiles, good residence units, cell phones, and PCs, these fashions assist cost-effective fine-tuning… pic.twitter.com/CknskVqPem

— Hunyuan (@TencentHunyuan) August 4, 2025

The quantisation benchmarks present minimal efficiency degradation. On the DROP benchmark, the Hunyuan-7B-Instruct mannequin scores 85.9 in its base B16 format, 86.0 with FP8, and 85.7 with Int4 GPTQ, indicating that effectivity beneficial properties don’t come at a price to accuracy.

For deployment, Tencent recommends utilizing established frameworks like TensorRT-LLM, vLLM, or SGLang to serve the Hunyuan fashions and create OpenAI-compatible API endpoints, making certain they are often built-in easily into current improvement workflows. This mix of efficiency, effectivity, and deployment flexibility positions the Hunyuan sequence as a seamless highly effective contender in open-source AI.

See additionally: Deep Cogito v2: Open-source AI that hones its reasoning expertise

Need to be taught extra about AI and large information from trade leaders? Take a look at AI & Big Data Expo going down in Amsterdam, California, and London. The excellent occasion is co-located with different main occasions together with Intelligent Automation Conference, BlockX, Digital Transformation Week, and Cyber Security & Cloud Expo.

Discover different upcoming enterprise know-how occasions and webinars powered by TechForge here.

See also  ElevenLabs adds AI voice of celebs to new digital narrator — but is it safe?



Source link

TAGGED: Hunyuan, models, opensource, releases, Tencent, Versatile
Share This Article
Twitter Email Copy Link Print
Previous Article Hyve Managed Hosting partners with Digital Realty to expand global operations Hyve Managed Hosting partners with Digital Realty to expand global operations
Next Article Is the EU AI Act a step in the right direction? Is the EU AI Act a step in the right direction?
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Cape Raises $61M in Funding

Cape, an Arlington, VA-based privacy-first cellular provider, raised $61M in funding. The spherical was led…

April 21, 2024

Blackcore Launches ACE Overclocked Servers Powered by AMD CPUs

- Blackcore® Applied sciences, a number one producer of high-performance, dependable, overclocked servers, in the…

October 21, 2024

Snyk Debuts AppRisk Pro to Enhance Application Security with AI Integration

World supplier of developer safety options, Snyk, has launched Snyk AppRisk Professional, an answer that mixes…

May 3, 2024

NVIDIA Debuts Quantum Computing Cloud Services Platform

With the assistance of a brand new cloud service from NVIDIA, scientists might now discover…

March 21, 2024

Partsol Raises $17.5M in Funding

Partsol, a Tampa, FL-based Cognitive AI firm and developer of the proprietary Absolute Fact algorithms,…

November 18, 2024

You Might Also Like

ASML's high-NA EUV tools clear the runway for next-gen AI chips
AI

ASML’s high-NA EUV tools clear the runway for next-gen AI chips

By saad
Poor implementation of AI may be behind workforce reduction
AI

Poor implementation of AI may be behind workforce reduction

By saad
Upgrading agentic AI for finance workflows
AI

Upgrading agentic AI for finance workflows

By saad
Goldman Sachs and Deutsche Bank test agentic AI for trade surveillance
AI

Goldman Sachs and Deutsche Bank test agentic AI in trading

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.