OpenAI launches new voice intelligence features in its API
The new features could be handy for customer service systems, but OpenAI says they have applications that work across a variety of other fields, including education and creator platforms.
OpenAI said Thursday that its API will now include a number of new voice intelligence features designed to help developers create apps that can talk, transcribe, and translate conversations with users.
The company’s new GPT‑Realtime‑2 is another voice model, built to create a realistic vocal simulation that can converse with users. However, unlike its predecessor (GPT-Realtime-1.5) this one is built with GPT‑5‑class reasoning that OpenAI says was created to deal with more complicated requests from users.
The company is also launching GPT‑Realtime‑Translate, which, just as it sounds, is designed to provide real-time translation services that “keep pace” with the user, conversationally. The feature includes more than 70 input languages (that is, the languages that it can comprehend) and 13 output languages (the languages it relays to the speaker).
Finally, the company has also launched a new transcription capability, GPT-Realtime-Whisper, which gives users live speech-to-text capabilities that are captured as interactions occur.
“Together, the models we are launching move real-time audio from simple call-and-response toward voice interfaces that can actually do work: listen, reason, translate, transcribe, and take action as a conversation unfolds,” the company said.
Who will these updates be good for? Companies that want to expand customer service capabilities are an obvious target. However, OpenAI also notes that its new features will assist with a wide array of areas, including education, media, events, and creator platforms, among others.
As useful as these tools seem from an enterprise perspective, it also seems plausible that they could be misused. The company said it has built guardrails to stop its new features from being abused to create spam, fraud, or other forms of online abuse. Certain triggers have been embedded in the system so that “conversations can be halted if they are detected as violating our harmful content guidelines,” OpenAI said.
This Week Only: Buy one pass, get the second at 50% off
This Week Only: Buy one pass, get the second at 50% off
All of the new voice models are included in OpenAI’s Realtime API . Translate and Whisper are billed by the minute, while GPT-Realtime-2 is billed by token consumption.
When you purchase through links in our articles, we may earn a small commission . This doesn’t affect our editorial independence.
StrictlyVC Athens is up next. Hear unfiltered insights straight from Europe’s tech leaders and connect with the people shaping what’s ahead. Lock in your spot before it’s gone.
Hackers deface school login pages after claiming another Instructure hack Lorenzo Franceschi-Bicchierai Zack Whittaker
Hackers deface school login pages after claiming another Instructure hack
Hackers deface school login pages after claiming another Instructure hack
Hackers steal students’ data during breach at education tech giant Instructure Lorenzo Franceschi-Bicchierai
Hackers steal students’ data during breach at education tech giant Instructure
Hackers steal students’ data during breach at education tech giant Instructure
As workers worry about AI, Nvidia’s Jensen Huang says AI is ‘creating an enormous number of jobs’ Lucas Ropek
As workers worry about AI, Nvidia’s Jensen Huang says AI is ‘creating an enormous number of jobs’
As workers worry about AI, Nvidia’s Jensen Huang says AI is ‘creating an enormous number of jobs’
Anthropic and OpenAI are both launching joint ventures for enterprise AI services Russell Brandom
Anthropic and OpenAI are both launching joint ventures for enterprise AI services
Anthropic and OpenAI are both launching joint ventures for enterprise AI services
Ouster’s new color lidar is coming to replace cameras Sean O'Kane
Ouster’s new color lidar is coming to replace cameras
Ouster’s new color lidar is coming to replace cameras
This tiny, magnetic e-reader could stop you from doomscrolling Amanda Silberling
This tiny, magnetic e-reader could stop you from doomscrolling
This tiny, magnetic e-reader could stop you from doomscrolling
Uber wants to turn its millions of drivers into a sensor grid for self-driving companies Connie Loizos
Uber wants to turn its millions of drivers into a sensor grid for self-driving companies
Uber wants to turn its millions of drivers into a sensor grid for self-driving companies
Key takeaways
- OpenAI's new voice features could revolutionize customer service in Brazil by providing more personalized experiences.
- The ability to translate and transcribe in real-time can promote inclusion and democratization of knowledge in educational settings.
- Responsible adoption of these technologies is essential to prevent abuse and ensure beneficial use in the Brazilian context.
Editorial analysis
The introduction of OpenAI's new voice intelligence features has the potential to transform how Brazilian companies interact with their customers. In the context of customer service, these tools can not only improve efficiency but also provide a more personalized and engaging experience. With the growing demand for automated solutions, especially in a market that seeks constant innovation, integrating these technologies can be a significant competitive advantage for local businesses.
Moreover, the applications in education and creator platforms are particularly relevant for Brazil, where linguistic and cultural diversity is a significant factor. The ability to translate and transcribe in real-time can facilitate access to information and communication in educational settings, promoting inclusion and democratization of knowledge. Educational institutions that adopt these technologies could offer more interactive and accessible learning experiences.
However, it is crucial to observe the ethical implications and challenges associated with using these tools. OpenAI mentions implementing safeguards to prevent abuse, but the effectiveness of these measures still needs to be evaluated. In Brazil, where misinformation and online fraud are growing concerns, the responsible adoption of these technologies will be essential to ensure they are used beneficially rather than harmfully. The future of voice intelligence will depend not only on technological innovation but also on building an ecosystem that prioritizes ethics and social responsibility.
Finally, Brazilian companies should stay alert to the ongoing development of these technologies. The evolution of AI capabilities can open new market opportunities but will also require constant adaptation of business strategies. Integrating solutions like those from OpenAI could be an important step for companies looking to stand out in an increasingly digital and competitive environment.
What this coverage includes
- Clear source attribution and link to the original publication.
- Editorial framing about relevance, impact, and likely next developments.
- Review for readability, context, and duplication before publication.
Original source:
TechCrunch AIAbout this article
This article was curated and published by AIDaily as part of our editorial coverage of artificial intelligence developments. The content is based on the original source cited below, enriched with editorial context and analysis. Automated tools may assist with translation and initial structuring, but publication decisions, factual review, and contextual framing remain editorial responsibilities.
Learn more about our editorial process