Patent · US Expired

Methods and apparatus for automatic generation of multiple pronunciations from acoustic data

US7181395B1 · kind B1 · utility

25Cited by

10References

21Claims

0Family size

Assignee

International Business Machines Corporation · US

Inventors

Sabine Deligne · New York, US
Ramesh Gopinath · Millwood, US
Benoit Maison · White Plains, US

Key dates

Filing date	Oct 27, 2000
Grant date	Feb 20, 2007
Priority date	—
Expiry date	Aug 25, 2023

Classification

Technology area (CPC G)Physics
CPC primaryG10L15/065
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Methods and apparatus for automatically deriving multiple phonetic baseforms of a word from a speech utterance of this word are provided in accordance with the present invention. In one embodiment, a method of automatically generating two or more phonetic baseforms from a spoken utterance representing a word includes the steps of: transforming the spoken utterance into a stream of acoustic observations; generating two or more strings of subphone units, wherein each string of subphone units represents a string of subphone units substantially maximizing a log-likelihood of the stream of acoustic observations, and wherein the log-likelihood is computed as a weighted sum of a transition score associated with a transition model and of an acoustic score associated with an acoustic model; and converting the two or more strings of subphone units into two or more phonetic baseforms.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.