Skip to main content

Baidu’s SwiftScribe uses AI to transcribe audio files up to an hour in length

baidu
Image used with permission by copyright holder
Baidu may be known as “the Google of China,” but that doesn’t mean the Asian search giant doesn’t come up with its own unique applications. On Monday, it debuted SwiftScribe, a web app that automatically transcribes speech files with the help of artificial intelligence.

SwiftScribe is about as simple as web apps come. It recognizes files in .wav and .mp3 format, and once the upload’s complete, the transcription process gets underway. A 30-second file takes about 10 seconds, and a one-minute file less than 30. An hour of audio, the maximum length SwiftScribe will allow, takes 20 minutes.

Recommended Videos

It’s not always perfect. SwiftScribe sometimes misses the spelling of certain words, and capitalization and punctuation aren’t always on point. But it offers an editable field that lets users correct mistakes, and a built-in speed-shifting tool that plays the uploaded audio clip audio at a faster or slower speed.

Please enable Javascript to view this content

Baidu project manager Tian Wu, who was inspired partly by her experience transcribing interviews as a graduate student at the University of California, Santa Barbara, said that SwiftScribe has the potential to save hours. “English is not my first language,” Wu told VentureBeat. “It took 10 hours to transcribe one hour of audio. That’s my personal experience. Usually, it will take a professional four to six hours to transcribe a one-hour audio clip.”

Image used with permission by copyright holder

Wu told VentureBeat that SwiftScribe can help transcribe audio 1.67 times faster on average. She envisions transcriptionists doing more work and ultimately getting paid more for it.

SwiftScribe’s more proof of concept than polished product, right now. In the coming months, the team plans to enhance the app with video transcription and captioning, support for more file formats, and an option for automatically adding punctuation.

It’s free to use for now, but Baidu’s considering a paid option. “In the future, we hope to turn it into a business,” Wu said.

Baidu may not have the name recognition in the United States that it does in mainland China, where the Beijing-based juggernaut commands roughly 80 percent of the internet search market and amasses quarterly profits that regularly top the hundreds of millions. But it’s hoping to change that. In 2013, it opened the Institute of Deep Learning, a research center devoted to advancing the firm’s artificial intelligence efforts.

In the immediate future, the Chinese aims to use the lab to increase revenue by building augmented reality marketing tools. But it may be considering a significant expansion of health-care and education applications.

Kyle Wiggers
Former Digital Trends Contributor
Kyle Wiggers is a writer, Web designer, and podcaster with an acute interest in all things tech. When not reviewing gadgets…
Intel’s promised Arrow Lake autopsy details up to 30% loss in performance
The Core Ultra 9 285K socketed into a motherboard.

Intel's Arrow Lake CPUs didn't make it on our list of the best processors when they released earlier this year. As you can read in our Core Ultra 9 285K review, Intel's latest desktop offering struggled to keep pace with last-gen options, particularly in games, and showed strange behavior in apps like Premiere Pro. Now, Intel says it has fixed the issues with its Arrow Lake range, which accounted for up to a 30% loss in real-world performance compared to Intel's in-house testing.

The company identified five issues with the performance of Arrow Lake, four of which are resolved now. The latest BIOS and Windows Updates (more details on those later in this story) will restore Arrow Lake processors to their expected level of performance, according to Intel, while a new firmware will offer additional performance improvements. That firmware is expected to release in January, pushing beyond the baseline level of performance Intel expected out of Arrow Lake.

Read more
You can get this 40-inch LG UltraWide 5K monitor at $560 off if you hurry
A woman using the LG UltraWide 40WP95C-W 5K monitor.

If you need a screen to go with the upgrade that you made with desktop computer deals, and you're willing to spend for a top-of-the-line display, then you may want to set your sights on the LG 40WP95C-W UltraWide curved 5K monitor. From its original price of $1,800, you can get it for $1,240 from Walmart for huge savings of $560, or for $1,275 from Amazon for a $525 discount. You should complete your purchase quickly if you're interested though, as there's no telling when the offers for this monitor will expire.

Why you should buy the LG 40WP95C-W UltraWide curved 5K monitor
5K monitors are highly recommended for serious creative professionals, such as graphic designers and filmmakers, for their extremely sharp details and precise colors, and the LG 40WP95C-W UltraWide curved 5K monitor is an excellent choice. We've tagged it as the best ultrawide 5K monitor in our roundup of the best 5K monitors, with its huge 40-inch curved screen featuring 5120 x 2160 resolution, 98% coverage of the DCI-P3 spectrum, and support for HDR10 providing striking visuals that you won't enjoy from most of the other options in the market.

Read more
Generative-AI-powered video editing is coming to Instagram
Instagram on iPhone against a colorful background.

Editing your Instagram videos will soon be as simple as typing out a text prompt, thanks to a new generative AI tool the company hopes to release in 2025, CEO Adam Mosseri announced Thursday.

The upcoming tool, which leverages Meta's Movie Gen model, will enable users to "change nearly any aspect of your videos," Mosseri said during his preview demonstration. Those changes range from subtle modifications, like adding a gold chain to his existing outfit or a hippo in the background, to wholesale alterations including swapping his wardrobe or giving himself a felt, Muppet-like appearance.

Read more