Patent · US Expired

Automatic speech recognition with psychoacoustically-based feature extraction, using easily-tunable single-shape filters along logarithmic-frequency axis

US6701291B2 · kind B2 · utility

4Cited by
5References
60Claims
0Family size

Assignee

Inventors

Key dates

Filing dateApr 2, 2001
Grant dateMar 2, 2004
Priority date
Expiry dateMar 31, 2022

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L19/0212
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method and apparatus for extracting speech features from a speech signal in which the linear frequency spectrum data, as generated, for example, by a conventional frequency transform, is first converted to logarithmic frequency spectrum data having frequency data distributed on a substantially logarithmic (rather than linear) frequency scale. Then, a plurality of digital auditory filters is applied to the resultant logarithmic frequency spectrum data, each of these filters having a substantially similar shape, but centered at different points on the logarithmic frequency scale. Because each of the filters have a similar shape, the feature extraction approach of the present invention advantageously can be easily modified or tuned by adjusting each of the filters in a coordinated manner, with the adjustment of only a handful of filter parameters.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.