Patent · US Expired

Method for representing word models for use in speech recognition

US4903305A · kind A · utility

335Cited by

1References

34Claims

0Family size

Assignee

Dragon Systems, Inc. · US

Inventors

Laurence S. Gillick · Newton, US
Dean Sturtevant · Watertown, US
Robert Roth · Watertown, US
James K. Baker · Eatonville, US
Janet Baker · Newton, US

Key dates

Filing date	Mar 23, 1989
Grant date	Feb 20, 1990
Priority date	—
Expiry date	Mar 23, 2009

Classification

Technology area (CPC G)Physics
CPC primaryG10L2015/0631
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method is provided for deriving acoustic word representations for use in speech recognition. Initial word models are created, each formed of a sequence of acoustic sub-models. The acoustic sub-models from a plurality of word models are clustered, so as to group acoustically similar sub-models from different words, using, for example, the Kullback-Leibler information as a metric of similarity. Then each word is represented by cluster spelling representing the clusters into which its acoustic sub-models were placed by the clustering. Speech recognition is performed by comparing sequences of frames from speech to be recognized against sequences of acoustic models associated with the clusters of the cluster spelling of individual word models. The invention also provides a method for deriving a word representation which involves receiving a first set of frame sequences for a word, using dynamic programming to derive a corresponding initial sequence of probabilistic acoustic sub-models for the word independently of any previously derived acoustic model particular to the word, using dynamic programming to time align each of a second set of frame sequences for the word into a succession …

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.