Patent · US Expired

Automatic speech recognition with psychoacoustically-based feature extraction, using easily-tunable single-shape filters along logarithmic-frequency axis

US6701291B2 · kind B2 · utility

4Cited by

5References

60Claims

0Family size

Assignee

LUCENT TECHNOLOGIES INC. · US

Inventors

Qi Li · New Providence, US
Olivier Siohan · New York, US
Frank Kao-Ping Soong · Beijing, CN

Key dates

Filing date	Apr 2, 2001
Grant date	Mar 2, 2004
Priority date	—
Expiry date	Mar 31, 2022

Classification

Technology area (CPC G)Physics
CPC primaryG10L19/0212
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method and apparatus for extracting speech features from a speech signal in which the linear frequency spectrum data, as generated, for example, by a conventional frequency transform, is first converted to logarithmic frequency spectrum data having frequency data distributed on a substantially logarithmic (rather than linear) frequency scale. Then, a plurality of digital auditory filters is applied to the resultant logarithmic frequency spectrum data, each of these filters having a substantially similar shape, but centered at different points on the logarithmic frequency scale. Because each of the filters have a similar shape, the feature extraction approach of the present invention advantageously can be easily modified or tuned by adjusting each of the filters in a coordinated manner, with the adjustment of only a handful of filter parameters.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.