Speech representation by feature-based word prototypes comprising phoneme targets having reliable high similarity
US5684925A · kind A · utility
Assignee
Inventors
Key dates
| Filing date | Sep 8, 1995 |
| Grant date | Nov 4, 1997 |
| Priority date | — |
| Expiry date | Sep 8, 2015 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L15/063
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Digitized speech utterances are converted into phoneme similarity data and regions of high similarity are then extracted and used in forming the word prototype. By alignment across speakers unreliable high phoneme similarity regions are eliminated. Word prototype targets are then constructed comprising the following parameters: the phoneme symbol, the average peak height of the phoneme similarity score, the average peak location and the left and right frame locations. For each target a statistical weight is assigned representing the percentage of occurrences the particular high similarity region occurred across all speakers. The word prototype is feature-based allowing a robust speech representation to be constructed without the need for frame-by-frame analysis.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.