Patent · US Expired

Method for representing word models for use in speech recognition

US4903305A · kind A · utility

335Cited by
1References
34Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 23, 1989
Grant dateFeb 20, 1990
Priority date
Expiry dateMar 23, 2009

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2015/0631
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method is provided for deriving acoustic word representations for use in speech recognition. Initial word models are created, each formed of a sequence of acoustic sub-models. The acoustic sub-models from a plurality of word models are clustered, so as to group acoustically similar sub-models from different words, using, for example, the Kullback-Leibler information as a metric of similarity. Then each word is represented by cluster spelling representing the clusters into which its acoustic sub-models were placed by the clustering. Speech recognition is performed by comparing sequences of frames from speech to be recognized against sequences of acoustic models associated with the clusters of the cluster spelling of individual word models. The invention also provides a method for deriving a word representation which involves receiving a first set of frame sequences for a word, using dynamic programming to derive a corresponding initial sequence of probabilistic acoustic sub-models for the word independently of any previously derived acoustic model particular to the word, using dynamic programming to time align each of a second set of frame sequences for the word into a succession …

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.