Automatic speech recognition with psychoacoustically-based feature extraction, using easily-tunable single-shape filters along logarithmic-frequency axis
US6701291B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Apr 2, 2001 |
| Grant date | Mar 2, 2004 |
| Priority date | — |
| Expiry date | Mar 31, 2022 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L19/0212
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method and apparatus for extracting speech features from a speech signal in which the linear frequency spectrum data, as generated, for example, by a conventional frequency transform, is first converted to logarithmic frequency spectrum data having frequency data distributed on a substantially logarithmic (rather than linear) frequency scale. Then, a plurality of digital auditory filters is applied to the resultant logarithmic frequency spectrum data, each of these filters having a substantially similar shape, but centered at different points on the logarithmic frequency scale. Because each of the filters have a similar shape, the feature extraction approach of the present invention advantageously can be easily modified or tuned by adjusting each of the filters in a coordinated manner, with the adjustment of only a handful of filter parameters.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.