Skip to main content

ChatGPT’s highly anticipated Advanced Voice could arrive ‘next week’

screencap. two people sitting at a desk talking to OpenAI's Advanced Voice mode on a cellphone
OpenAI

OpenAI CEO and co-founder Sam Altman revealed on X (formerly Twitter) Thursday that its Advanced Voice feature will begin rolling out “next week,” though only for a few select ChatGPT-Plus subscribers.

The company plans to “start the alpha with a small group of users to gather feedback and expand based on what we learn.”

alpha rollout starts to plus subscribers next week!

— Sam Altman (@sama) July 25, 2024

Advanced Voice, which does away with the text prompt and enables users to converse directly with the AI as one would another human, was initially announced in May alongside the release of GPT-4o during the company’s Spring Update event. Unlike existing digital assistants like Siri and Google Assistant, which only provide canned answers to user queries, ChatGPT’s Advanced Voice provides human-like responses, nearly latency-free, and in multiple languages.

The GPT-4o model is able to respond to audio inputs in 320 milliseconds on average, which is on par with how quickly humans react to normal conversation. As you can see in the demo video below, the model can converse with multiple users simultaneously, improvise talking points and questions in both English and Portuguese as well as conveying them with human-ish emotions, including “laughter.”

Learning a new language with ChatGPT Advanced Voice Mode

There’s no word yet on how the company will choose participants for alpha trial aside from them being $20/month ChatGPT Plus-tier subscribers. The alpha release was originally scheduled for June, though that date was pushed back “to reach our bar to launch” and improve its ability to detect and reject prohibited forms of content, as well as buttress the company’s IT infrastructure to accommodate the anticipated user load increase.

As the company announced in June, the feature’s full rollout won’t happen until at least this fall, and its exact timing will, again, depend on it “meeting our high safety and reliability bar.”

Giving ChatGPT the ability to converse naturally with its users is a huge advancement. Eliminating the need for a context window reduce user hardware requirements and expand the potential integrations and use cases for AI (such as increasing access to users with body mobility or dexterity limitations).

It can also help speed the technology’s adoption by the public by reducing the barrier to entry for less-tech-savvy users who are comfortable with interacting with their computers via “hey Siri” but blanch at the prospect of prompt engineering.

Andrew Tarantola
Andrew has spent more than a decade reporting on emerging technologies ranging from robotics and machine learning to space…
ChatGPT: the latest news and updates on the AI chatbot that changed everything
ChatGPT app running on an iPhone.

In the ever-evolving landscape of artificial intelligence, ChatGPT stands out as a groundbreaking development that has captured global attention. From its impressive capabilities and recent advancements to the heated debates surrounding its ethical implications, ChatGPT continues to make headlines.

Whether you're a tech enthusiast or just curious about the future of AI, dive into this comprehensive guide to uncover everything you need to know about this revolutionary AI tool.
What is ChatGPT?
ChatGPT is a natural language AI chatbot. At its most basic level, that means you can ask it a question and it will generate an answer. As opposed to a simple voice assistant like Siri or Google Assistant, ChatGPT is built on what is called an LLM (Large Language Model). These neural networks are trained on huge quantities of information from the internet for deep learning -- meaning they generate altogether new responses, rather than just regurgitating canned answers. They're not built for a specific purpose like chatbots of the past -- and they're a whole lot smarter.

Read more
All the wild things people are doing with ChatGPT’s new Voice Mode
Nothing Phone 2a and ChatGPT voice mode.

ChatGPT's Advanced Voice Mode arrived on Tuesday for a select few OpenAI subscribers chosen to be part of the highly anticipated feature's alpha release.

The feature was first announced back in May. It is designed to do away with the conventional text-based context window and instead converse using natural, spoken words, delivered in a lifelike manner. It works in a variety of regional accents and languages. According to OpenAI, Advanced Voice, "offers more natural, real-time conversations, allows you to interrupt anytime, and senses and responds to your emotions."

Read more
GPT-4: everything you need to know about ChatGPT’s standard AI model
A laptop opened to the ChatGPT website.

People were in awe when ChatGPT came out, impressed by its natural language abilities as an AI chatbot originally powered by the GPT-3.5 large language model. But when the highly anticipated GPT-4 large language model came out, it blew the lid off what we thought was possible with AI, with some calling it the early glimpses of AGI (artificial general intelligence).
What is GPT-4?
GPT-4 is the newest language model created by OpenAI that can generate text that is similar to human speech. It advances the technology used by ChatGPT, which was previously based on GPT-3.5 but has since been updated. GPT is the acronym for Generative Pre-trained Transformer, a deep learning technology that uses artificial neural networks to write like a human.

According to OpenAI, this next-generation language model is more advanced than ChatGPT in three key areas: creativity, visual input, and longer context. In terms of creativity, OpenAI says GPT-4 is much better at both creating and collaborating with users on creative projects. Examples of these include music, screenplays, technical writing, and even "learning a user's writing style."

Read more