Patent · US Expired

Speech processing technique for use in speech recognition and speech coding

US6263306A · kind A · utility

8Cited by

4References

7Claims

0Family size

Assignee

LUCENT TECHNOLOGIES INC. · US

Inventors

Michael Sean Fee · New Vernon, US
Ching E. Ho · San Jose, US
Partha P. Mitra · New York, US
Bijan Pesaran · Pasadena, US

Key dates

Filing date	Feb 26, 1999
Grant date	Jul 17, 2001
Priority date	—
Expiry date	Feb 26, 2019

Classification

Technology area (CPC G)Physics
CPC primaryG10L15/02
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A technique for obtaining an intermediate set of frequency dependant features from a speech signal for use in speech processing and in obtaining estimates of speech pitch. The technique utilizes multiple tapers derived from Slepian sequences to obtain a product of the speech signal and the Slepian functions. Multiple tapered Fourier transforms are then obtained from the product, from which the set of frequency dependent features are calculated. In a preferred embodiment, a derivative of the cepstrum of the speech signal is used as an estimate of speech signal pitch. In another preferred embodiment, the F-spectrum is calculated from the product and the F-cepstrum is obtained therefrom by calculating the Fourier transform of the smoothed derivative of the log of the F-spectrum. The maximum of the F-cepstrum also provides a pitch estimation.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.