Skip to main content

Microsoft hits another milestone in speech-recognition software accuracy

If you’re fed up with chatbots mishearing you, Microsoft is making machine ears a little more attentive. Researchers from the tech giant have achieved an impressively low error rate for speech-recognition software — just 6.3 percent, according to a paper published last week. The company hopes this milestone will help refine and personalize its AI assistant, Cortana, and features like Skype Translator.

The newest error rate of Microsoft’s conversational speech-recognition system is regarded as the lowest in the industry, according to Xuedong Huang, Microsoft’s chief speech scientist. IBM meanwhile recently announced an error rate of 6.6 percent, bettering its 6.9 percent error rate from April and the 8 percent milestone that the company achieved last year. Two decades ago, the lowest error rate of a published system was more than 43 percent, Microsoft notes in a blog post.

Recommended Videos

In artificial intelligence development, researchers often model machines of off humans by equipping the systems with the abilities to speak, see, and hear. Although Microsoft’s achievement is just 0.3 percent below IBM’s, incremental advancements like these bring machines closer to human-like capabilities. In speech recognition, the human error rate is around 4 percent, according to IBM.

Please enable Javascript to view this content

“This new milestone benefited from a wide range of new technologies developed by the AI community from many different organizations over the past 20 years,” Microsoft’s Huang said.

A few of these technologies include biologically inspired systems called neural networks, a training technique known as deep learning, and the adoption of graphic processing units (GPUs) to process algorithms. Over the past two years, neural networks and deep learning have enabled AI researchers to develop and train systems in advanced speech recognition, image recognition, and natural language processing. Just last year, Microsoft created image-recognition software that outperformed humans.

Although initially designed for computer graphics, GPUs are now regularly used to process sophisticated algorithms. Cortana can process up to 10 times more data using GPUs than previous methods, according to Microsoft.

With steady advances like these, repeating your question to a chatbot may be a thing of the past.

Dyllan Furness
Former Digital Trends Contributor
Dyllan Furness is a freelance writer from Florida. He covers strange science and emerging tech for Digital Trends, focusing…
Microsoft Bing and Edge are getting a big DALL-E 3 upgrade
Microsoft Copilot comes to Bing and Edge.

Microsoft Copilot is coming to Bing and Edge Microsoft

You'll soon be hearing more about Microsoft Copilot and Bing Image Creator as these innovative technologies come to Microsoft Edge and Bing. The news of their arrival was delivered at Microsoft's Surface Event, along with several more AI and hardware announcements.

Read more
Microsoft’s Copilot AI will have an ‘energy,’ apparently
The Microsoft Windows logo surrounded by colors of red, green, yellow and blue.

Microsoft has just unveiled the latest version of Windows 11, and it features updates across the operating system, from AI to new tools and features.

Among the updates are changes to Microsoft’s Copilot AI tool, which will have more features to help users in apps like Word and Excel, as well as within Windows 11 itself. Copilot can be used to summarize meetings, write emails, help with analysis, and much more.

Read more
Microsoft accidentally released 38TB of private data in a major leak
A large monitor displaying a security hacking breach warning.

It’s just been revealed that Microsoft researchers accidentally leaked 38TB of confidential information onto the company’s GitHub page, where potentially anyone could see it. Among the data trove was a backup of two former employees’ workstations, which contained keys, passwords, secrets, and more than 30,000 private Teams messages.

According to cloud security firm Wiz, the leak was published on Microsoft’s artificial intelligence (AI) GitHub repository and was accidentally included in a tranche of open-source training data. That means visitors were encouraged to download it, meaning it could have fallen into the wrong hands again and again.

Read more