Patent · US Active

Predicting pronunciation in speech recognition

US10339920B2 · kind B2 · utility

3Cited by

7References

21Claims

0Family size

Assignee

AMAZON TECHNOLOGIES, INC. · US

Inventors

Jeffrey Penrod Adams · Tyngsborough, US
Alok Ulhas Parlikar · Somerville, US
Jeffrey Paul Lilly · Seattle, US
Ariya Rastrow · Seattle, US

Key dates

Filing date	Mar 4, 2014
Grant date	Jul 2, 2019
Priority date	—
Expiry date	Mar 23, 2034

Classification

Technology area (CPC G)Physics
CPC primaryG10L2015/025
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

An automatic speech recognition (ASR) device may be configured to predict pronunciations of textual identifiers (for example, song names, etc.) based on predicting one or more languages of origin of the textual identifier. The one or more languages of origin may be determined based on the textual identifier. The pronunciations may include a hybrid pronunciation including a pronunciation in one language, a pronunciation in a second language and a hybrid pronunciation that combines multiple languages. The pronunciations may be added to a lexicon and matched to the content item (e.g., song) and/or textual identifier. The ASR device may receive a spoken utterance from a user requesting the ASR device to access the content item. The ASR device determines whether the spoken utterance matches one of the pronunciations of the content item in the lexicon. The ASR device then accesses the content when the spoken utterance matches one of the potential textual identifier pronunciations.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.