Patent · US Expired

Speech processing technique for use in speech recognition and speech coding

US6263306A · kind A · utility

8Cited by
4References
7Claims
0Family size

Assignee

Inventors

Key dates

Filing dateFeb 26, 1999
Grant dateJul 17, 2001
Priority date
Expiry dateFeb 26, 2019

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L15/02
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A technique for obtaining an intermediate set of frequency dependant features from a speech signal for use in speech processing and in obtaining estimates of speech pitch. The technique utilizes multiple tapers derived from Slepian sequences to obtain a product of the speech signal and the Slepian functions. Multiple tapered Fourier transforms are then obtained from the product, from which the set of frequency dependent features are calculated. In a preferred embodiment, a derivative of the cepstrum of the speech signal is used as an estimate of speech signal pitch. In another preferred embodiment, the F-spectrum is calculated from the product and the F-cepstrum is obtained therefrom by calculating the Fourier transform of the smoothed derivative of the log of the F-spectrum. The maximum of the F-cepstrum also provides a pitch estimation.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.