Patent · US Active

Acoustic model training corpus selection

US9378731B2 · kind B2 · utility

6Cited by

0References

20Claims

0Family size

Assignee

Google LLC · US

Inventors

Olga Kapralova · Bern, CH
John Paul Alex · Brooklyn, US
Eugene Weinstein · New York, US
Pedro J. Moreno Mengibar · Jersey City, US
Olivier Siohan · New York, US
Ignacio Lopez Moreno · New York, US

Key dates

Filing date	Apr 22, 2015
Grant date	Jun 28, 2016
Priority date	—
Expiry date	Apr 22, 2035

Classification

Technology area (CPC G)Physics
CPC primaryG10L2015/0633
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

The present disclosure relates to training a speech recognition system. One example method includes receiving a collection of speech data items, wherein each speech data item corresponds to an utterance that was previously submitted for transcription by a production speech recognizer. The production speech recognizer uses initial production speech recognizer components in generating transcriptions of speech data items. A transcription for each speech data item is generated using an offline speech recognizer, and the offline speech recognizer components are configured to improve speech recognition accuracy in comparison with the initial production speech recognizer components. The updated production speech recognizer components are trained for the production speech recognizer using a selected subset of the transcriptions of the speech data items generated by the offline speech recognizer. An updated production speech recognizer component is provided to the production speech recognizer for use in transcribing subsequently received speech data items.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.