Patent · US Active

Multi-dialect and multilingual speech recognition

US11900915B2 · kind B2 · utility

4Cited by

2References

20Claims

0Family size

Assignee

Google LLC · US

Inventors

Zhifeng Chen · Sunnyvale, US
Bo Li · 东风镇, CN
Eugene Weinstein · New York, US
Yonghui Wu · Fremont, US
Pedro J. Moreno Mengibar · Jersey City, US
Ron J. Weiss · New York, US
Khe Chai Sim · Dublin, US
Tara N. Sainath · Jersey City, US
Patrick Nguyen · Kirkland, US

Key dates

Filing date	Jan 10, 2022
Grant date	Feb 13, 2024
Priority date	—
Expiry date	Jan 10, 2042

Classification

Technology area (CPC G)Physics
CPC primaryG10L2015/0631
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer-readable media, for speech recognition using multi-dialect and multilingual models. In some implementations, audio data indicating audio characteristics of an utterance is received. Input features determined based on the audio data are provided to a speech recognition model that has been trained to output score indicating the likelihood of linguistic units for each of multiple different language or dialects. The speech recognition model can be one that has been trained using cluster adaptive training. Output that the speech recognition model generated in response to receiving the input features determined based on the audio data is received. A transcription of the utterance generated based on the output of the speech recognition model is provided.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.