OpenAI launches new voice intelligence features in its API

OpenAI said Thursday that its API will now include a number of new voice intelligence features designed to help developers create apps that can talk, transcribe, and translate conversations with users.

The company’s new GPT‑Realtime‑2 is another voice model, built to create a realistic vocal simulation that can converse with users. However, unlike its predecessor (GPT-Realtime-1.5) this one is built with GPT‑5‑class reasoning that OpenAI says was created to deal with more complicated requests from users.

The company is also launching GPT‑Realtime‑Translate, which, just as it sounds, is designed to provide real-time translation services that “keep pace” with the user, conversationally. The feature includes more than 70 input languages (that is, the languages that it can comprehend) and 13 output languages (the languages it relays to the speaker).

Finally, the company has also launched a new transcription capability, GPT-Realtime-Whisper, which gives users live speech-to-text capabilities that are captured as interactions occur.

“Together, the models we are launching move real-time audio from simple call-and-response toward voice interfaces that can actually do work: listen, reason, translate, transcribe, and take action as a conversation unfolds,” the company said.

Who will these updates be good for? Companies that want to expand customer service capabilities are an obvious target. However, OpenAI also notes that its new features will assist with a wide array of areas, including education, media, events, and creator platforms, among others.

As useful as these tools seem from an enterprise perspective, it also seems plausible that they could be misused. The company said it has built guardrails to stop its new features from being abused to create spam, fraud, or other forms of online abuse. Certain triggers have been embedded in the system so that “conversations can be halted if they are detected as violating our harmful content guidelines,” OpenAI said.

This Week Only: Buy one pass, get the second at 50% off

All of the new voice models are included in OpenAI’s Realtime API . Translate and Whisper are billed by the minute, while GPT-Realtime-2 is billed by token consumption.

When you purchase through links in our articles, we may earn a small commission . This doesn’t affect our editorial independence.

StrictlyVC Athens is up next. Hear unfiltered insights straight from Europe’s tech leaders and connect with the people shaping what’s ahead. Lock in your spot before it’s gone.

Hackers deface school login pages after claiming another Instructure hack Lorenzo Franceschi-Bicchierai Zack Whittaker

Hackers deface school login pages after claiming another Instructure hack

Hackers steal students’ data during breach at education tech giant Instructure Lorenzo Franceschi-Bicchierai

Hackers steal students’ data during breach at education tech giant Instructure

As workers worry about AI, Nvidia’s Jensen Huang says AI is ‘creating an enormous number of jobs’ Lucas Ropek

As workers worry about AI, Nvidia’s Jensen Huang says AI is ‘creating an enormous number of jobs’

Anthropic and OpenAI are both launching joint ventures for enterprise AI services Russell Brandom

Anthropic and OpenAI are both launching joint ventures for enterprise AI services

Ouster’s new color lidar is coming to replace cameras Sean O'Kane

Ouster’s new color lidar is coming to replace cameras

This tiny, magnetic e-reader could stop you from doomscrolling Amanda Silberling

This tiny, magnetic e-reader could stop you from doomscrolling

Uber wants to turn its millions of drivers into a sensor grid for self-driving companies Connie Loizos

Uber wants to turn its millions of drivers into a sensor grid for self-driving companies

Key takeaways

OpenAI's new voice features could revolutionize customer service in Brazil by providing more personalized experiences.

The ability to translate and transcribe in real-time can promote inclusion and democratization of knowledge in educational settings.

Responsible adoption of these technologies is essential to prevent abuse and ensure beneficial use in the Brazilian context.

Editorial analysis

The introduction of OpenAI's new voice intelligence features has the potential to transform how Brazilian companies interact with their customers. In the context of customer service, these tools can not only improve efficiency but also provide a more personalized and engaging experience. With the growing demand for automated solutions, especially in a market that seeks constant innovation, integrating these technologies can be a significant competitive advantage for local businesses.

Moreover, the applications in education and creator platforms are particularly relevant for Brazil, where linguistic and cultural diversity is a significant factor. The ability to translate and transcribe in real-time can facilitate access to information and communication in educational settings, promoting inclusion and democratization of knowledge. Educational institutions that adopt these technologies could offer more interactive and accessible learning experiences.

However, it is crucial to observe the ethical implications and challenges associated with using these tools. OpenAI mentions implementing safeguards to prevent abuse, but the effectiveness of these measures still needs to be evaluated. In Brazil, where misinformation and online fraud are growing concerns, the responsible adoption of these technologies will be essential to ensure they are used beneficially rather than harmfully. The future of voice intelligence will depend not only on technological innovation but also on building an ecosystem that prioritizes ethics and social responsibility.

Finally, Brazilian companies should stay alert to the ongoing development of these technologies. The evolution of AI capabilities can open new market opportunities but will also require constant adaptation of business strategies. Integrating solutions like those from OpenAI could be an important step for companies looking to stand out in an increasingly digital and competitive environment.

OpenAI launches new voice intelligence features in its API

Key takeaways

Editorial analysis

What this coverage includes

About this article