Method of assessing degree of acoustic confusability, and system therefor
US7013276B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Oct 5, 2001 |
| Grant date | Mar 14, 2006 |
| Priority date | — |
| Expiry date | Oct 12, 2023 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L15/10
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Predicting speech recognizer confusion where utterances can be represented by any combination of text form and audio file. The utterances are represented with an intermediate representation that directly reflects the acoustic characteristics of the utterances. Text representations of the utterances can be directly used for predicting confusability without access to audio file examples of the utterances. First embodiment: two text utterances are represented with strings of phonemes and one of the strings of phonemes is transformed into the other strings of phonemes for a least cost as a confusability measure. Second embodiment: two utterances are represented with an intermediate representation of sequences of acoustic events based on phonetic capabilities of speakers obtained from acoustic signals of the utterances and the acoustic events are compared. Predicting confusability of the utterances according to a formula 2K/(T), K is a number of matched acoustic events and T is a total number of acoustic events.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.