Patent · US Expired

Methods and apparatus for automatic generation of multiple pronunciations from acoustic data

US7181395B1 · kind B1 · utility

25Cited by
10References
21Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 27, 2000
Grant dateFeb 20, 2007
Priority date
Expiry dateAug 25, 2023

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L15/065
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods and apparatus for automatically deriving multiple phonetic baseforms of a word from a speech utterance of this word are provided in accordance with the present invention. In one embodiment, a method of automatically generating two or more phonetic baseforms from a spoken utterance representing a word includes the steps of: transforming the spoken utterance into a stream of acoustic observations; generating two or more strings of subphone units, wherein each string of subphone units represents a string of subphone units substantially maximizing a log-likelihood of the stream of acoustic observations, and wherein the log-likelihood is computed as a weighted sum of a transition score associated with a transition model and of an acoustic score associated with an acoustic model; and converting the two or more strings of subphone units into two or more phonetic baseforms.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.