Friday, 20 Mar 2026
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > Tencent Hunyuan Video-Foley brings lifelike audio to AI video
AI

Tencent Hunyuan Video-Foley brings lifelike audio to AI video

Last updated: August 29, 2025 11:26 am
Published August 29, 2025
Share
Tencent Hunyuan Video-Foley brings lifelike audio to AI video
SHARE

A staff at Tencent’s Hunyuan lab has created a brand new AI, ‘Hunyuan Video-Foley,’ that lastly brings lifelike audio to generated video. It’s designed to hearken to movies and generate a high-quality soundtrack that’s completely in sync with the motion on display.

Ever watched an AI-generated video and felt like one thing was lacking? The visuals could be beautiful, however they typically have an eerie silence that breaks the spell. Within the movie trade, the sound that fills that silence – the rustle of leaves, the clap of thunder, the clink of a glass – is named Foley artwork, and it’s a painstaking craft carried out by consultants.

Matching that stage of element is a large problem for AI. For years, automated programs have struggled to create plausible sounds for movies.

How is Tencent fixing the AI-generated audio for video drawback?

One of many greatest causes video-to-audio (V2A) fashions typically fell brief within the sound division was what the researchers name “modality imbalance”. Primarily, the AI was listening extra to the textual content prompts it was given than it was watching the precise video.

As an example, in the event you gave a mannequin a video of a busy seaside with individuals strolling and seagulls flying, however the textual content immediate solely stated “the sound of ocean waves,” you’d possible simply get the sound of waves. The AI would fully ignore the footsteps within the sand and the calls of the birds, making the scene really feel lifeless.

On high of that, the standard of the audio was typically subpar, and there merely wasn’t sufficient high-quality video with sound to coach the fashions successfully.

See also  Tencent Cloud unveils AIoT 2.0 to integrate multimodal AI in global smart devices

Tencent’s Hunyuan staff tackled these issues from three totally different angles:

  1. Tencent realised the AI wanted a greater training, in order that they constructed a large, 100,000-hour library of video, audio, and textual content descriptions for it to study from. They created an automatic pipeline that filtered out low-quality content material from the web, eliminating clips with lengthy silences or compressed, fuzzy audio, guaranteeing the AI discovered from the absolute best materials.
  1. They designed a better structure for the AI. Consider it like instructing the mannequin to correctly multitask. The system first pays extremely shut consideration to the visual-audio hyperlink to get the timing excellent—like matching the thump of a footstep to the precise second a shoe hits the pavement. As soon as it has that timing locked down, it then incorporates the textual content immediate to grasp the general temper and context of the scene. This twin strategy ensures the particular particulars of the video are by no means ignored.
  1. To ensure the sound was high-quality, they used a coaching technique referred to as Illustration Alignment (REPA). That is like having an knowledgeable audio engineer continuously wanting over the AI’s shoulder throughout its coaching. It compares the AI’s work to options from a pre-trained, professional-grade audio mannequin to information it in direction of producing cleaner, richer, and extra steady sound.

At the moment we’re saying the open-source launch of HunyuanVideo-Foley, our new end-to-end Textual content-Video-to-Audio (TV2A) framework for producing high-fidelity audio.🚀

This device empowers creators in video manufacturing, filmmaking, and sport improvement to generate professional-grade… pic.twitter.com/mff2m5xFvC

— Hunyuan (@TencentHunyuan) August 28, 2025

The outcomes communicate sound for themselves

When Tencent examined Hunyuan Video-Foley in opposition to different main AI fashions, the audio outcomes have been clear. It wasn’t simply that the computer-based metrics have been higher; human listeners constantly rated its output as larger high quality, higher matched to the video, and extra precisely timed.

See also  From hallucinations to hardware: Lessons from a real-world computer vision project gone sideways

Throughout the board, the AI delivered enhancements in making the sound match the on-screen motion, each by way of content material and timing. The outcomes throughout a number of analysis datasets assist this:

Evaluation results of Tencent Hunyuan Video-Foley against other leading AI models.

Tencent’s work helps to shut the hole between silent AI movies and an immersive viewing expertise with high quality audio. It’s bringing the magic of Foley artwork to the world of automated content material creation, which could possibly be a strong functionality for filmmakers, animators, and creators in all places.

See additionally: Google Vids will get AI avatars and image-to-video instruments

Banner for the AI & Big Data Expo event series.

Need to study extra about AI and massive knowledge from trade leaders? Take a look at AI & Big Data Expo happening in Amsterdam, California, and London. The excellent occasion is a part of TechEx and is co-located with different main know-how occasions, click on here for extra info.

AI Information is powered by TechForge Media. Discover different upcoming enterprise know-how occasions and webinars here.



Source link

TAGGED: audio, brings, Hunyuan, lifelike, Tencent, Video, VideoFoley
Share This Article
Twitter Email Copy Link Print
Previous Article The AI breakthrough that uses almost no power to create images The AI breakthrough that uses almost no power to create images
Next Article Outline planning approved for data centre in Hemel Hempstead Outline planning approved for data centre in Hemel Hempstead
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

XrossRoad Announces Strategic Partnership with Berachain to Expand Japanese IP in Web3

Group-driven IP platform “Xross Road” and superior AI expertise supplier “Allora Network” have shaped a…

February 18, 2025

Additive manufactured aluminum alloys for space optical instruments

A mounted plastic prototype of the Compact Hyperspectral Air Air pollution Sensor Demonstrator (CHAPS-D) instrument,…

March 24, 2024

Tennr Raises $101M in Series C Funding

Cofounders Trey Holterman, Tyler Johnson, and Diego Baugh Tennr, a NYC-based supplier of an orchestration…

June 19, 2025

Aetherflux joins the race to launch orbital data centers by 2027

Enterprises will hook up with and handle orbital workloads “the identical approach they handle cloud…

December 13, 2025

NTT Expanding Data Centers in North Texas

Worldwide telecommunications agency NTT Information is increasing its footprint in North Texas with an enormous…

May 3, 2024

You Might Also Like

NVIDIA Agent Toolkit Gives Enterprises a Framework to Deploy AI Agents at Scale
AI

NVIDIA Agent Toolkit Gives Enterprises a Framework to Deploy AI Agents at Scale

By saad
Visa prepares payment systems for AI agent-initiated transactions
AI

Visa prepares payment systems for AI agent-initiated transactions

By saad
For effective AI, insurance needs to get its data house in order
AI

For effective AI, insurance needs to get its data house in order

By saad
Mastercard keeps tabs on fraud with new foundation model
AI

Mastercard keeps tabs on fraud with new foundation model

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.