Saturday, 13 Dec 2025
Subscribe
logo
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Font ResizerAa
Data Center NewsData Center News
Search
  • Global
  • AI
  • Cloud Computing
  • Edge Computing
  • Security
  • Investment
  • Sustainability
  • More
    • Colocation
    • Quantum Computing
    • Regulation & Policy
    • Infrastructure
    • Power & Cooling
    • Design
    • Innovations
    • Blog
Have an existing account? Sign In
Follow US
© 2022 Foxiz News Network. Ruby Design Company. All Rights Reserved.
Data Center News > Blog > AI > OpenAI expands Realtime API with new voices and cuts prices for developers
AI

OpenAI expands Realtime API with new voices and cuts prices for developers

Last updated: October 31, 2024 12:37 am
Published October 31, 2024
Share
OpenAI expands Realtime API with new voices and cuts prices for developers
SHARE

Be part of our each day and weekly newsletters for the newest updates and unique content material on industry-leading AI protection. Be taught Extra


OpenAI up to date its Realtime API as we speak, which is at the moment in beta. This replace provides new voices for speech-to-speech functions to its platform and cuts prices related to caching prompts. 

Beta customers of the Realtime API will now have 5 new voices they’ll use to construct their functions. OpenAI showcased three of the brand new voices, Ash, Verse and the British-sounding Ballad, in a put up on X. 

Two Realtime API updates:

– Now you can construct speech-to-speech experiences with 5 new voices—that are rather more expressive and steerable. ???

– We’re decreasing the worth through the use of immediate caching. Cached textual content inputs are discounted 50% and cached audio inputs are discounted… pic.twitter.com/jLzZDBrR7l

— OpenAI Builders (@OpenAIDevs) October 30, 2024

The corporate stated in its API documentation that the native speech-to-speech function “skip[s] an intermediate textual content format means low latency and nuanced output,” whereas the voices are simpler to steer and extra expressive than its earlier voices. 

Nevertheless, OpenAI warns it can not supply client-side authentication for the API now because it’s nonetheless in beta. It additionally stated that there could also be points with processing real-time audio. 

“Community situations closely have an effect on real-time audio, and delivering audio reliably from a shopper to a server at scale is difficult when community situations are unpredictable,” the corporate shared.

OpenAI’s historical past with AI-powered speech and voices has been controversial. In March, it launched Voice Engine, a voice cloning platform to rival ElevenLabs, however it restricted entry to only some researchers. In Could, after the corporate demoed its GPT-4o and Voice Mode, it paused utilizing one of many voices, Sky, after the actress Scarlett Johansson spoke out about its similarity to her voice. 

See also  Anthropic vs. OpenAI red teaming methods reveal different security priorities for enterprise AI

The corporate rolled out ChatGPT Superior Voice Mode for paying subscribers (these utilizing ChatGPT Plus, Enterprise, Groups and Edu) within the U.S. in September. 

Speech-to-speech AI would ideally let enterprises construct extra real-time responses utilizing a voice. Suppose a buyer calls an organization’s customer support platform. In that case, the speech-to-speech functionality can take the particular person’s voice, perceive what they’re asking, and reply utilizing an AI-generated voice with decrease latency. Speech-to-speech additionally lets customers generate voice-overs, with a consumer talking their traces, however the voice output just isn’t theirs. One platform that gives that is Replica and, in fact, ElevenLabs.  

OpenAI launched the Realtime API this month throughout its Dev Day. The API goals to hurry up the constructing of voice assistants.

Decreasing prices

Utilizing speech-to-speech options, although, might get costly. 

When Realtime API launched, the pricing construction was at $0.06 per minute of audio enter and $0.24 per audio output, which isn’t low-cost. Nevertheless, the corporate plans to decrease real-time API costs with immediate caching. 

Cached textual content inputs will drop by 50%, and cached audio inputs will probably be discounted by 80%.

OpenAI additionally introduced Immediate Caching throughout Dev Day and would hold continuously requested contexts and prompts within the mannequin’s reminiscence. This may drop the variety of tokens it must create to generate responses. Decreasing enter costs, might encourage extra builders to hook up with the API. 

OpenAI just isn’t the one firm to roll out Immediate Caching. Anthropic launched immediate caching for Claude 3.5 Sonnet in August. 


Source link
TAGGED: API, Cuts, developers, expands, OpenAI, Prices, realtime, voices
Share This Article
Twitter Email Copy Link Print
Previous Article Itai Schwartz, Co-Founder and CTO, Eran Barak, Co-Founder and CEO and Hod Bin Noon, Co-Founder and VP of R&D. Credit- Ohad Kab JPEG.. (1) MIND Raises $11M in Funding
Next Article Gcore unveils data centre in Incheon, South Korea Vertiv appoints new Executive VP to for Global Portfolio & Business Units
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Your Trusted Source for Accurate and Timely Updates!

Our commitment to accuracy, impartiality, and delivering breaking news as it happens has earned us the trust of a vast audience. Stay ahead with real-time updates on the latest events, trends.
FacebookLike
TwitterFollow
InstagramFollow
YoutubeSubscribe
LinkedInFollow
MediumFollow
- Advertisement -
Ad image

Popular Posts

Data Center Efficiency Will Overcome AI-Fueled Build-Out Challenges | DCN

If we had a conversation at this time last year, just as OpenAI released ChatGPT…

January 25, 2024

From dot-com to dot-AI: How we can learn from the last tech transformation (and avoid making the same mistakes)

Be part of our every day and weekly newsletters for the newest updates and unique…

May 18, 2025

UniUni Closes US$50M Series C Funding

CEO Peter Lu, Uniuni (CNW Group/UniUni) UniUni, a Richmond, BC, Canada-based firm which focuses on…

April 21, 2024

OpenAI's GPT-5.2 is here: what enterprises need to know

The rumors had been true: OpenAI on Thursday introduced the discharge of its new frontier…

December 12, 2025

A quantum neural network can see optical illusions like humans do. Could it be the future of AI?

(a) Sketch of the QT-DNN construction. W(n) with n = 1 … 4 are the matrices of…

August 31, 2024

You Might Also Like

Google’s new framework helps AI agents spend their compute and tool budget more wisely
AI

Google’s new framework helps AI agents spend their compute and tool budget more wisely

By saad
BBVA embeds AI into banking workflows using ChatGPT Enterprise
AI

BBVA embeds AI into banking workflows using ChatGPT Enterprise

By saad
Ai2's new Olmo 3.1 extends reinforcement learning training for stronger reasoning benchmarks
AI

Ai2's new Olmo 3.1 extends reinforcement learning training for stronger reasoning benchmarks

By saad
Experimental AI concludes as autonomous systems rise
AI

Experimental AI concludes as autonomous systems rise

By saad
Data Center News
Facebook Twitter Youtube Instagram Linkedin

About US

Data Center News: Stay informed on the pulse of data centers. Latest updates, tech trends, and industry insights—all in one place. Elevate your data infrastructure knowledge.

Top Categories
  • Global Market
  • Infrastructure
  • Innovations
  • Investments
Usefull Links
  • Home
  • Contact
  • Privacy Policy
  • Terms & Conditions

© 2024 – datacenternews.tech – All rights reserved

Welcome Back!

Sign in to your account

Lost your password?
We use cookies to ensure that we give you the best experience on our website. If you continue to use this site we will assume that you are happy with it.
You can revoke your consent any time using the Revoke consent button.