Monday, 12 May 2025
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > Tencent’s EzAudio AI transforms text to lifelike sound, sparking innovation and debate
AI

Tencent’s EzAudio AI transforms text to lifelike sound, sparking innovation and debate

Last updated: September 22, 2024 5:36 pm
Published September 22, 2024
Share
Tencent's EzAudio AI transforms text to lifelike sound, sparking innovation and debate
SHARE

Be a part of our every day and weekly newsletters for the newest updates and unique content material on industry-leading AI protection. Be taught Extra


Researchers from Johns Hopkins University and Tencent AI Lab have launched EzAudio, a brand new text-to-audio (T2A) era mannequin that guarantees to ship high-quality sound results from textual content prompts with unprecedented effectivity. This development marks a big leap in synthetic intelligence and audio know-how, addressing a number of key challenges in AI-generated audio.

EzAudio operates within the latent house of audio waveforms, departing from the normal technique of utilizing spectrograms. “This innovation permits for prime temporal decision whereas eliminating the necessity for a further neural vocoder,” the researchers state of their paper printed on the project’s website.

Reworking audio AI: How EzAudio-DiT works

The mannequin’s structure, dubbed EzAudio-DiT (Diffusion Transformer), incorporates a number of technical improvements to boost efficiency and effectivity. These embody a brand new adaptive layer normalization method referred to as AdaLN-SOLA, long-skip connections, and the mixing of superior positioning strategies like RoPE (Rotary Place Embedding).

“EzAudio produces extremely real looking audio samples, outperforming present open-source fashions in each goal and subjective evaluations,” the researchers declare. In comparative assessments, EzAudio demonstrated superior efficiency throughout a number of metrics, together with Frechet Distance (FD), Kullback-Leibler (KL) divergence, and Inception Score (IS).

AI audio market heats up: EzAudio’s potential impression

The discharge of EzAudio comes at a time when the AI audio era market is experiencing fast progress. ElevenLabs, a distinguished participant within the discipline, just lately launched an iOS app for text-to-speech conversion, signaling rising client curiosity in AI audio instruments. In the meantime, tech giants like Microsoft and Google proceed to take a position closely in AI voice simulation applied sciences.

See also  When is ART useful? When it's IBM's Adversarial Robustness Toolbox for AI

Gartner predicts that by 2027, 40% of generative AI options might be multimodal, combining textual content, picture, and audio capabilities. This development means that fashions like EzAudio, which deal with high-quality audio era, may play an important position within the evolving AI panorama.

Nevertheless, the widespread adoption of AI within the office just isn’t with out issues. A current Deloitte study discovered that nearly half of all staff are nervous about shedding their jobs to AI. Paradoxically, the examine additionally revealed that those that use AI extra regularly at work are extra involved about job safety.

Moral AI audio: Navigating the way forward for voice know-how

As AI audio era turns into extra refined, questions of ethics and accountable use come to the forefront. The flexibility to generate real looking audio from textual content prompts raises issues about potential misuse, such because the creation of deepfakes or unauthorized voice cloning.

The EzAudio workforce has made their code, dataset, and mannequin checkpoints publicly available, emphasizing transparency and inspiring additional analysis within the discipline. This open method may speed up developments in AI audio know-how whereas additionally permitting for broader scrutiny of potential dangers and advantages.

Wanting forward, the researchers recommend that EzAudio may have purposes past sound impact era, together with voice and music manufacturing. Because the know-how matures, it could discover use in industries starting from leisure and media to accessibility providers and digital assistants.

EzAudio marks a pivotal second in AI-generated audio, providing unprecedented high quality and effectivity. Its potential purposes span leisure, accessibility, and digital assistants. Nevertheless, this breakthrough additionally amplifies moral issues round deepfakes and voice cloning. As AI audio know-how races ahead, the problem lies in harnessing its potential whereas safeguarding towards misuse. The way forward for sound is right here — however are we able to face the music?

See also  Silicon Valley shaken as open-source AI models Llama 3.1 and Mistral Large 2 match industry leaders

Source link
TAGGED: debate, EzAudio, innovation, lifelike, sound, sparking, Tencents, text, transforms
Share This Article
Twitter Email Copy Link Print
Previous Article 97% of AI leaders commit to responsible AI 97% of AI leaders commit to responsible AI
Next Article Telstra Achieves 1.6Tb/s Over 700km Using Ciena’s WL6e Telstra Achieves 1.6Tb/s Over 700km Using Ciena’s WL6e
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Synergy Spine Solutions Raises $30M Financing

Synergy Spine Solutions, a Louisville, CO-based medical device developer, raised $30M in funding. The round…

February 1, 2024

Akamai Secures Edgio Patents and Contracts in Strategic Acquisition Deal

International CDN, cybersecurity and cloud computing firm Akamai Applied sciences (NASDAQ: AKAM) has emerged because…

November 25, 2024

Validose Raises $2M in Pre-Seed Funding

Validose, a NYC-based precision medicine supply firm, raised $2M in Pre-Seed funding. The spherical was…

February 23, 2025

Cradlepoint unveils high-speed 5G router for retail pop-ups and small offices

Cradlepoint, a cloud-delivered 5G and LTE wireless network edge solution provider, has launched the E100…

January 23, 2024

Industry Insiders’ Cloud and Edge Computing Predictions for 2024 | DCN

Last year, FinOps — a strategy for achieving cloud cost optimization — garnered a lot…

January 24, 2024

You Might Also Like

Quantum, blue glow, 3D image
Global Market

Quantum computing gets an error-correction boost from AI innovation

By saad
Mem0's scalable memory promises more reliable AI agents that remembers context across lengthy conversations
AI

Mem0’s scalable memory promises more reliable AI agents that remembers context across lengthy conversations

By saad
The walled garden cracks: Nadella bets Microsoft’s Copilots—and Azure’s next act—on A2A/MCP interoperability
AI

The walled garden cracks: Nadella bets Microsoft’s Copilots—and Azure’s next act—on A2A/MCP interoperability

By saad
From silicon to sentience: The legacy guiding AI's next frontier and human cognitive migration
AI

From silicon to sentience: The legacy guiding AI’s next frontier and human cognitive migration

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.OkNoPrivacy policy
You can revoke your consent any time using the Revoke consent button.Revoke consent