Patent · US Expired

Method and system for low bit rate speech coding with speech recognition features and pitch providing reconstruction of the spectral envelope

US6678655B2 · kind B2 · utility

28Cited by

11References

26Claims

0Family size

Assignee

International Business Machines Corporation · US

Inventors

Ron Hoory · Haifa, IL
Dan Chazan · Haifa, IL
Ezra Silvera · Haifa, IL
Meir Zibulski · Haifa, IL

Key dates

Filing date	Nov 12, 2002
Grant date	Jan 13, 2004
Priority date	—
Expiry date	Nov 12, 2022

Classification

Technology area (CPC G)Physics
CPC primaryG10L15/02
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method for encoding a digitized speech signal so as to generate data capable of being decoded as speech. A digitized speech signal is first converted to a series of feature vectors using for example known Mel-frequency Cepstral coefficients (MFCC) techniques. At successive instances instance of time a respective pitch value of the digitized speech signal is computed, and successive acoustic vectors each containing the respective pitch value and feature vector are compressed so as to derive therefrom a bit stream. A suitable decoder reverses the operation so as to extract the features vectors and pitch values, thus allowing speech reproduction and playback. In addition, speech recognition is possible using the decompressed feature vectors, with no impairment of the recognition accuracy and no computational overhead.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.