Methods and apparatus for automatic generation of multiple pronunciations from acoustic data
US7181395B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Oct 27, 2000 |
| Grant date | Feb 20, 2007 |
| Priority date | — |
| Expiry date | Aug 25, 2023 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L15/065
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Methods and apparatus for automatically deriving multiple phonetic baseforms of a word from a speech utterance of this word are provided in accordance with the present invention. In one embodiment, a method of automatically generating two or more phonetic baseforms from a spoken utterance representing a word includes the steps of: transforming the spoken utterance into a stream of acoustic observations; generating two or more strings of subphone units, wherein each string of subphone units represents a string of subphone units substantially maximizing a log-likelihood of the stream of acoustic observations, and wherein the log-likelihood is computed as a weighted sum of a transition score associated with a transition model and of an acoustic score associated with an acoustic model; and converting the two or more strings of subphone units into two or more phonetic baseforms.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.