Skip to main content

Digital Trends may earn a commission when you buy through links on our site. Why trust us?

The new AI tool that was deemed ‘too dangerous’ to release

Back in 2019, OpenAI refused to release its full research into the development of GPT2 over fears that it was “too dangerous” to release publicly. On Thursday, OpenAI’s biggest financial backer, Microsoft, made a similar pronouncement about its new VALL-E 2 voice synthesizer AI.

The VALL-E 2 system is a zero-shot text-to-speech synthesis (TTS) AI, meaning that it can recreate hyper-realistic speech based on just a few seconds of sample audio. Per the research team, VALL-E 2 “surpasses previous systems in speech robustness, naturalness, and speaker similarity. It is the first of its kind to reach human parity on these benchmarks.”

Recommended Videos

The system reportedly can even handle sentences that are difficult to pronounce because of their structural complexity or repetitive phrasing, such as tongue twisters.

There are a host of potential beneficial uses for such a system, like enabling people suffering from aphasia or Amyotrophic lateral sclerosis (commonly known as ALS or Lou Gehrig’s disease) to speak again, albeit through a computer, as well as use in education, entertainment, journalism, chatbots and translation, or as accessibility features and “interactive voice response systems,” like Siri. However, the team also recognizes numerous opportunities for the public to misuse its technology, “such as spoofing voice identification or impersonating a specific speaker.”

As such the AI will only be available for research purposes. “Currently, we have no plans to incorporate VALL-E 2 into a product or expand access to the public,” the team wrote. ” If you suspect that VALL-E 2 is being used in a manner that is abusive or illegal or infringes on your rights or the rights of other people, you can report it at the Report Abuse Portal.”

Microsoft is hardly alone in its efforts to train computers to speak as humans do. Google’s Chirp, ElevenLabs’ Iconic Voices, and Voicebox from Meta all aim to perform similar functions.

However, such systems have come under ethical scrutiny as they have repeatedly been used to scam unsuspecting victims by emulating the voice of a loved one or a well-known celebrity. And unlike generated images, there’s currently no way to effectively “watermark” AI generated audio.

Andrew Tarantola
Andrew Tarantola is a journalist with more than a decade reporting on emerging technologies ranging from robotics and machine…
Google’s AI detection tool is now available for anyone to try
Gemini running on the Google Pixel 9 Pro Fold.

Google announced via a post on X (formerly Twitter) on Wednesday that SynthID is now available to anybody who wants to try it. The authentication system for AI-generated content embeds imperceptible watermarks into generated images, video, and text, enabling users to verify whether a piece of content was made by humans or machines.

“We’re open-sourcing our SynthID Text watermarking tool,” the company wrote. “Available freely to developers and businesses, it will help them identify their AI-generated content.”

Read more
Radiohead’s Thom Yorke among thousands of artists who issue AI protest
Thom Yorke on stage.

Leading actors, authors, musicians, and novelists are among 11,500 artists to have put their name to a statement calling for a halt to the unlicensed use of creative works to train generative AI tools like OpenAI’s ChatGPT, describing it as a “threat” to the livelihoods of creators.

The open letter, comprising just 29 words, says: “The unlicensed use of creative works for training generative AI is a major, unjust threat to the livelihoods of the people behind those works, and must not be permitted.”

Read more
The best AI chatbots to try: ChatGPT, Gemini, and more
Bing Chat shown on a laptop.

The idea of chatbots has been around since the early days of the internet. But even compared to popular voice assistants like Siri, the generated chatbots of the modern era are far more powerful.

Yes, you can converse with them in natural language. But these AI chatbots can generate text of all kinds, from poetry to code, and the results really are exciting. ChatGPT remains in the spotlight, but as interest continues to grow, more rivals are popping up to challenge it.
OpenAI ChatGPT and ChatGPT Plus

Read more