Don’t miss OpenAI, Chevron, Nvidia, Kaiser Permanente, and Capital One leaders solely at VentureBeat Rework 2024. Acquire important insights about GenAI and broaden your community at this unique three day occasion. Study Extra
Per week in the past, ElevenLabs, the AI voice startup based by former Google and Palantir engineers, made headlines with its first main consumer-centric product – a Reader app.
Presently available on iOS, the product is a devoted voiceover resolution that converts any textual content file or hyperlink from the net into AI audio, narrated in several AI voices and accents. At this time, the corporate introduced it’s increasing this library of voices on the app to incorporate AI voices of late Hollywood celebs Judy Garland, James Dean, Burt Reynolds and Sir Laurence Olivier.
The corporate has partnered with CMG Worldwide, the agency managing and defending the mental property rights of dwelling and deceased celebrities, to recreate and launch the enduring voices. Moreover, it plans to construct on this work with many extra celebrated AI voices set to launch within the coming months.
Reader provides AI voice to any digital textual content
Whereas ElevenLabs has particularly targeted on the artistic trade with AI fashions for text-to-speech and speech-to-speech conversion, dubbing and sound impact creation, the Reader app provides a extra tailor-made type to its analysis within the text-to-speech area. All a consumer has to do is give the hyperlink or file for any digital textual content – be it an article, PDF, publication or 300-page e-book – and the app immediately processes the textual content and begins the voiceover AI narration, with a inexperienced highlighter following alongside and highlighting every phrase spoken by the AI.
Countdown to VB Rework 2024
Be part of enterprise leaders in San Francisco from July 9 to 11 for our flagship AI occasion. Join with friends, discover the alternatives and challenges of Generative AI, and discover ways to combine AI purposes into your trade. Register Now
The characteristic is accessible in English, though customers can customise their expertise by selecting from 11 voices and accents, from male to feminine, American to Austrian to British English. Now, the Iconic voices launched as we speak provides to this expertise, permitting customers to find and expertise content material within the voice of the late stars.
Think about a consumer with the ability to hearken to L. Frank Baum’s The Great Wizard of Oz within the voice of late Judy Garland who acted within the cinematic adaption of the novel.
For the members of the family of the late stars, the AI-based voice recreation is a chance to make it possible for the celebs’ legacies stay on, with their present followers getting a approach to reconnect with them, and new-age customers getting a approach to uncover them. In the meantime, for ElevenLabs, the announcement is predicted to drive extra engagement on the brand new app.
“Judy Garland, James Dean, Burt Reynolds and Sir Laurence Olivier are a few of the most celebrated actors in historical past. We deeply respect their legacy and are honored to have their voices as a part of our platform,” stated Dustin Clean, head of partnerships at ElevenLabs “Including them to our rising record of narrators marks a serious step ahead in our mission of creating content material accessible in any language and voice.”
Are these AI voices secure from abuse?
One of many greatest considerations related to voice cloning expertise – just like the one at play right here – is that voice recreations of recognized personalities can painting them as saying issues they by no means really stated in the true world. Biden’s Robocall incident is the largest instance of such a problem. In the identical manner, what if a CEO’s voice is cloned to make them say issues that might doubtlessly wreck their or their firm’s status?
ElevenLabs says it understands these considerations and is shifting to broaden partnerships for the enduring voices characteristic with a selected concentrate on security.
Sam Sklar, who handles progress advertising at ElevenLabs, instructed VentureBeat that the corporate retains full management over movie star voices and makes them out there solely on the Reader app, which has been designed in such a manner that customers can solely convert digital textual content into AI narration for particular person consumption — relatively than additional sharing or downloading.
“For instance, by the Reader App, you possibly can select an article on VentureBeat and choose Judy Garland to relate it only for you. You can’t entry her voice by the ElevenLabs voice library (a separate internet product of the corporate). This implies they will’t be used along side our typical text-to-speech instruments on the platform, nor can the content material they converse by the Reader App be downloaded or shared,” he defined.
If a consumer uploads dangerous content material as textual content to report its iconic voice narration by a secondary machine, the corporate won’t even generate the AI voiceover. It has positioned automated and human moderation processes in between to determine and block hate speech and different types of textual content that violate its phrases of service.
As for the possibilities of the voice library being misused to clone celeb voices from scratch, Sklar says the platform has been constructed with a number of safeguards, together with a voice captcha verification that matches the audio samples uploaded for cloning with the voice recording of the consumer. If the voice doesn’t match after just a few makes an attempt, the cloning request will not be processed. There’s additionally a “no go” voices coverage in place, which prohibits the cloning of voices deemed excessive danger.
“Any try and clone these voices will likely be blocked,” Sklar stated.
Whereas these steps do cut back the possibilities of celebs, actors and enterprise executives’ voices being cloned, there nonetheless could be instances of violations. As an illustration, malicious customers might craft the content material for the Reader app in such a manner that it bypasses the moderation measures positioned by the corporate.
In the long term, it will likely be attention-grabbing to see how the enduring voices functionality, which has been positioned as an providing for followers and lovers, impacts the trade. The Reader app internet hosting it will likely be rolling out each globally and to Android units this summer season. Assist for extra languages can be on the way in which.
Source link