Skip to main content

Baidu’s Deep Voice 2 text-to-speech engine can imitate hundreds of human accents

baidu
Image used with permission by copyright holder
Baidu, the Beijing-based juggernaut that commands 80 percent of the Chinese internet search market, is investing heavily in artificial intelligence. In 2013, it opened the Institute of Deep Learning, an R&D center focused on machine learning. And in May, it took the wraps off the newest version of Deep Voice, its AI-powered text-to-speech engine.

Deep Voice 2, which follows on the heels of Deep Voice’s public debut earlier this year, can produce real-time speech that’s nearly indistinguishable from a human voice. All the more impressive, it needs just thirty minutes of audio to build a working model, and can imitate the regional accents of hundreds of different speakers.

Recommended Videos

That’s leaps and bounds better than early versions of Deep Voice, which took multiple hours to learn one voice.

They key is Deep Voice 2’s ability to identify similarities between hundreds of different speakers to build a working model of a human voice. Then, it autonomously derives unique voices from that model — unlike voice assistants like Apple’s Siri, which require that a human record thousands of hours of speech that engineers tune by hand, Deep Voice 2 doesn’t require guidance or manual intervention.

Baidu (sign)
Image used with permission by copyright holder

“Give it the right data, and it can learn on [its] own what sort of features are important,” Andrew Gibiansky, a research scientist at Baidu’s Silicon Valley AI Lab, told The Verge.

Baidu isn’t the only company investing in high-quality text-to-speech tech. Google’s WaveNet, a product of the company’s DeepMind division, generates voices by sampling real human speech and independently creating its own sounds in a variety of voices. Adobe’s Project VoCo transcribes human speech to editable text in real time. And Lyrebird, a Canadian AI startup, licenses algorithms that can imitate any voice with just a single minute of sample audio, create one thousand sentences in less than half a second, and can infuse the speech it creates with emotions like anger, sympathy, and stress.

But don’t expect Deep Voice 2 or WaveNet to replace Siri, the Google Assistant, or Amazon’s Alexa anytime soon — AI-powered translation apps require more resources than today’s phones can reasonably supply. But Baidu sees potential in applications like text-to-speech apps and voice-based assistants. “The ability to quickly synthesize multiple human voices will have a huge effect on products such as personal assistants and eBook readers in the future. For example, each character of your eBook could have a unique voice when you listen to the eBook.”

Kyle Wiggers
Former Digital Trends Contributor
Kyle Wiggers is a writer, Web designer, and podcaster with an acute interest in all things tech. When not reviewing gadgets…
30 early Black Friday deals for 2024: TVs, laptops, headphones
Digital Trends Best Black Friday Amazon Deals

Update 11/11/24: We’re officially on our way to Black Friday, and our coverage spans everything currently available at a discount. Check back regularly, as we’ll be adding to all of the best early Black Friday deals below as more and more deals pop up.

Black Friday may not officially be here yet, but early Black Friday deals have arrived, and they’re all worth shopping. We’re seeing plenty of great early Black Friday TV deals, early Black Friday laptop deals, early Black Friday smartwatch deals, and much, much more. Below we’re giving it all a home, and rounding up all of the best early Black Friday deals available right now. Scroll down through the aisle and don’t hesitate to make a purchase, as we’ve already seen pricing change several times and have even seen some deals disappear entirely.
SAMSUNG Galaxy SmartTag2 -- $21 $30 29% off

Read more
Early Black Friday Garmin Watch Deals 2024: Up to $100 off
Garmin Forerunner 255

Even though Black Friday starts on November 29, which is still a while away, there are a lot of great early Black Friday deals on Garmin watches that are worth grabbing. As you may know, Garmin watches tend to be quite pricey, with some of the best Garmin Watches easily going into the thousand-dollar range. Of course, being some of the best smartwatches on the market, it makes sense, which is why we've gone out and collected some of our favorite early Black Friday Garmin watch deals for you below to help save you a little extra.
Garmin Forerunner 55 -- $170 $200 15% off

If you're an avid runner and want something that's specially made to track that, then the Forerunner 55 is a great budget-oriented option that won't break the bank. For $170 instead of the usual $200, you get a variety of tracks, such as Vo2 Max and resting heart rate, internal storage of 32Gb, and water resistance down to 165 feet.

Read more
Early Black Friday Google Pixel deals: Pixel 9, Pixel Watch 2, and more
The Google Pixel 8's screen.

We’ve still got some time until Black Friday deals officially arrive, but now is a good time to start your shopping, as there are a lot of early Black Friday deals worth grabbing. Among them are devices from Google’s Pixel lineup, which is great news of you’ve got your eye on Black Friday phone deals, Black Friday smartwatch deals, Black Friday tablet deals, or Black Friday headphone deals. We’ve tracked down all of the best early Black Friday Google Pixel deals available in the lead-up to the sales event, so read onward for all of the details on how to land some savings right now.
Google Pixel Buds Pro — $136 $200 32% off

If you’re looking for some of the best headphone deals during these early Black Friday offerings, you definitely want to pounce on this deal. The Google Pixel Buds Pro are seeing a $64 price drop and a Black Friday price of $136. The newer Google Pixel Buds Pro 2 are also on the market, but they aren’t seeing any Black Friday deals, and this first generation model of the popular earbuds will provide plenty of audio quality for most people.

Read more