speech recognition

Amazon debuts automatic speech recognition service, Amazon Transcribe Medical

Amazon is expanding its automatic transcription service for AWS, Amazon Transcribe, to include support for medical speech, the company announced this morning at its AWS re:Invent conference. The new

Google will help you pronounce difficult words

Google wants to make it easier to learn word pronunciations. Today, it introduced a new Search feature that will let users practice saying tricky words. When you look up a pronunciation, Google will
Google Search now helps you pronounce ‘quokka’

Google Search now helps you pronounce ‘quokka’

Google is adding a nifty new feature to its search results when you look for the pronunciation of words. You’ll now be able to not just hear the correct pronunciation, but you can also now practice
Facebook details wav2vec, an AI algorithm that uses raw audio to improve speech recognition

Facebook details wav2vec, an AI algorithm that uses raw audio to improve speech recognition

Facebook detailed wav2vec, a novel AI system that leverages raw audio to substantially improve speech recognition accuracy.
Amazon’s AI reduces real-time speech recognition error rate by 6.2%

Amazon’s AI reduces real-time speech recognition error rate by 6.2%

Researchers at Amazon describe a battery of techniques for incorporating machine learning models into speech recognizers.
Google open-sources Live Transcribe’s speech engine

Google open-sources Live Transcribe’s speech engine

Google has open-sourced the speech engine that powers its Android speech recognition transcription tool Live Transcribe on GitHub.

Google details AI work behind Project Euphonia’s more inclusive speech recognition

As part of new efforts towards accessibility, Google announced Project Euphonia at I/O in May: An attempt to make speech recognition capable of understanding people with non-standard speaking voices
Nvidia Unveils Conversational AI Tech for Smarter Bots

Nvidia Unveils Conversational AI Tech for Smarter Bots

Going beyond request-response speech recognition to conversational AI requires solving some challenging performance problems. Today Nvidia released code that improves on the current state of the art
Dasha AI is calling so you don’t have to

Dasha AI is calling so you don’t have to

While you’d be hard-pressed to find any startup not brimming with confidence over the disruptive idea they’re chasing, it’s not often you come across a young company as calmly convinced it’s
Google updates its speech tech for contact centers

Google updates its speech tech for contact centers

Last July, Google announced its Contact Center AI product for helping businesses get more value out of their contact centers. Contact Center AI uses a mix of Google’s machine learning-powered tools to
Google debuts better transcription, endless streaming, and more in Contact Center AI

Google debuts better transcription, endless streaming, and more in Contact Center AI

Google bolstered its nascent Contact Center AI service with a raft of features that vastly improve speech recognition accuracy.
Adobe brings Alexa integration to its XD prototyping tool

Adobe brings Alexa integration to its XD prototyping tool

Adobe XD, the company’s increasingly popular prototyping and design tool, is getting support for testing Amazon Alexa voice experiences on devices like the Echo Dot and the Echo Show. This work builds
Apple’s Voice Control improves accessibility OS-wide on all its devices

Apple’s Voice Control improves accessibility OS-wide on all its devices

Apple is known for fluid, intuitive user interfaces, but none of that matters if you can't click, tap, or drag because you don't have a finger to do so with. For users with disabilities the company is
Microsoft’s AI generates realistic speech with only 200 training samples

Microsoft’s AI generates realistic speech with only 200 training samples

In a newly published paper, Microsoft researchers describe a state-of-the-art AI speech system that needs only 200 samples to achieve high accuracy.
Google’s Live Transcribe is getting sound events and transcription saving

Google’s Live Transcribe is getting sound events and transcription saving

Google is updating Live Transcribe with new features, sound events, and transcription saving, coming to the speech recognition tool next month.
Alexa speech normalization AI reduces errors by up to 81%

Alexa speech normalization AI reduces errors by up to 81%

In a newly published paper, Amazon scientists describe a machine learning text normalization system that reduces errors by up to 81%.
XPRIZE names two grand prize winners in $15 million Global Learning Challenge

XPRIZE names two grand prize winners in $15 million Global Learning Challenge

XPRIZE, the non-profit organization developing and managing competitions to find solutions to social challenges, has named two grand prize winners in the Elon Musk-backed Global Learning XPRIZE . The
IBM’s AI performs state-of-the-art broadcast news captioning

IBM’s AI performs state-of-the-art broadcast news captioning

Researchers at IBM say they've devised an AI system that achieves state-of-the-art results on broadcast captioning tasks.
Amazon Alexa scientists retrain an English-language AI model on Japanese

Amazon Alexa scientists retrain an English-language AI model on Japanese

In a new paper, scientists at Amazon's Alexa division describe a transfer learning technique that yields an improvement.
ProBeat: Has Google’s word error rate progress stalled?

ProBeat: Has Google’s word error rate progress stalled?

If Google hit a wall in 2017 with cloud-powered speech recognition, it makes sense to shift resources to improving offline, on-device speech recognition.