New-word pronunciation learning using a pronunciation graph
US7590533B2 · kind B2 · utility
Assignee
Inventor
Key dates
| Filing date | Mar 10, 2004 |
| Grant date | Sep 15, 2009 |
| Priority date | — |
| Expiry date | Dec 16, 2026 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/025
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method and computer-readable medium convert the text of a word and a user's pronunciation of the word into a phonetic description to be added to a speech recognition lexicon. Initially, a plurality of at least two possible phonetic descriptions are generated. One phonetic description is formed by decoding a speech signal representing a user's pronunciation of the word. At least one other phonetic description is generated from the text of the word. The plurality of possible sequences comprising speech-based and text-based phonetic descriptions are aligned and scored in a single graph based on their correspondence to the user's pronunciation. The phonetic description with the highest score is then selected for entry in the speech recognition lexicon.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.