Patent · US Active

Utterance selection for automated speech recognizer training

US9263033B2 · kind B2 · utility

4Cited by

0References

20Claims

0Family size

Assignee

Google LLC · US

Inventors

Olivier Siohan · New York, US
Pedro J. Moreno Mengibar · Jersey City, US

Key dates

Filing date	Jun 25, 2014
Grant date	Feb 16, 2016
Priority date	—
Expiry date	Sep 18, 2034

Classification

Technology area (CPC G)Physics
CPC primaryG10L2015/0635
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating a set of training utterances. The methods, systems, and apparatus include actions of obtaining a target multi-dimensional distribution of characteristics in an initial set of candidate utterances and selecting a subset of the initial set of candidate utterances based on speech recognition confidence scores associated with the candidate utterances. Additional actions include selecting a particular candidate utterance from the subset of the initial set of utterances and determining that adding the particular candidate utterance to a set of training utterances reduces a divergence of a multi-dimensional distribution of the characteristics in the set of training utterances from the target multi-dimensional distribution. Further actions include adding the particular candidate utterance to the set of training utterances.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.