Skip to main content

Deepfakes for voice are here, and that’s good news — for now

LOVO side-by-side comparison

Everyone reading this will likely be familiar with deepfakes, the sometimes humorous and oftentimes scary technology that makes it possible to alter an existing image or video by digitally replacing a person’s likeness. But while deepfakes are well known, the audio equivalent — capable of simulating the voice of a real person — hasn’t received quite the same level of coverage.

Recommended Videos

Nonetheless, the technology to do this is out there, and getting better all the time. The latest demonstration of this tech in action comes from the startup LOVO Studio. The company has developed a new tool it claims can re-create accurate (and recognizable) human voices, complete with emotion and tone gradations to add to the realism. While the results aren’t perfect across the board, they can be eerily accurate at times.

“We take a recording data of your voice, run it through our proprietary machine learning models that learn your voice’s tone, pitch, speed, accents, habits, and other bits that truly make your voice unique, and create a clone of your voice which actually can figure out how you would speak or pronounce certain words even if they weren’t included in the original data that was fed in,” Tom Lee, co-founder of LOVO, told Digital Trends.

In a demonstration of its tech, seen above, LOVO re-creates the voices of famous public figures such as the late South African president Nelson Mandela. While this could potentially be used for damaging purposes (imagine combining these tools with deepfake videos to fabricate politicians saying things they never actually said), LOVO has its eye on useful applications.

“Imagine a radio ad on Spotify that calls out each user by their name,” Lee said. “[That would be] one million variations of the same ad, created with a few clicks, no extra recording necessary. Imagine teachers and corporate lecturers cloning their voices and turning their courses to audio files without having to record each new session. Imagine preserving the voice of your loved one and making your smart speaker talk to you in that voice. The possibilities are simply endless.”

The company’s LOVO Studio platform will be aimed at any application which requires synthetic voices. That could be marketing videos, e-learning materials, audiobook publishing, gaming companies, smart speaker companies, and more. It features a number of synthesized voices that can be used to create the perfect voiceover for any purpose, complete with control over things like tone, pronunciation, speed, and other elements.

Luke Dormehl
Former Digital Trends Contributor
I'm a UK-based tech writer covering Cool Tech at Digital Trends. I've also written for Fast Company, Wired, the Guardian…
We have some good news about AMD’s next massive CPU launch
The Ryzen 5 7600X sitting among thermal paste and RAM.

For a while, AMD's highly anticipated new CPUs seemed like they had slipped to 2025. That would have been bad news for AMD, but according to some new reporting on the topic, Zen 5 chips may be getting announced sooner than we thought.

As reported by Paul Alcorn of Tom's Hardware, an AMD rep confirmed that the Zen 5 lineup is on track for a 2024 release, likely in the second half of the year. The information was shared during an AMD earnings call for the final quarter of 2023.

Read more
The ROG Ally 2 is coming. Here’s why that’s a great sign
Starfield running on the Asus ROG Ally.

It's no surprise that Asus is already working on the next generation of its gaming handheld, the ROG Ally -- but I never expected it to be this fast. According to Asus, the next-gen ROG Ally is likely to be released this year. That might feel a little rushed, but it's great sign for the future of handhelds.

Oftentimes, when news like this comes out, it's a rumor or a leak that you can't be sure about. However, this time, the information comes straight from the source. Asus India's vice president for consumer and gaming PC, Arnold Su, told Techlusive in an interview that the second-gen handheld is "most likely" coming this year.

Read more
Spotify using AI to clone and translate podcasters’ voices
spotify app available in windows 10 store

Spotify has unveiled a remarkable new feature powered by artificial intelligence (AI) that translates a podcast into multiple languages using the same voices of those in the show.

It’s been made possible partly by OpenAI’s just-released voice generation technology that needs only a few seconds of listening to replicate a voice.

Read more