Patent · US Active

System and methods for acoustic and language modeling for automatic speech recognition with large vocabularies

US7801727B2 · kind B2 · utility

5Cited by

11References

20Claims

0Family size

Assignee

Nuance Communications, Inc. · US

Inventors

Ponani Gopalakrishnan · New Delhi, IN
Dimitri Kanevsky · Ossining, US
Michael D. Monkowski · New Windsor, US
Jan Sedivy · Praha, CZ

Key dates

Filing date	Feb 24, 2005
Grant date	Sep 21, 2010
Priority date	—
Expiry date	Dec 1, 2027

Classification

Technology area (CPC Y)Emerging Cross-Sectional Technologies
CPC primaryY10S707/99942
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method for generating a language component vocabulary VC for a speech recognition system having a language vocabulary V of a plurality of word forms is disclosed. The method includes: partitioning the language vocabulary V into subsets of word forms based on frequencies of occurrence of the respective word forms; and in at least one of the subsets, splitting word forms having frequencies less than a threshold to thereby generate word form components. Also disclosed is a method for use in speech recognition including: splitting an acoustic vocabulary comprising baseforms into baseform components and storing the baseform components; and, performing sound to spelling mapping on the baseform components so as to generate a baseform components to word parts table for use in subsequent decoding of speech. A method for decoding a speech utterance using language model components and acoustic components, includes the steps of: generating from the utterance a stack of baseform component paths; concatenating baseform components in a path to generate concatenated baseforms, when the concatenated baseform components correspond to a baseform found in an acoustic vocabulary; mapping the concaten…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.