Patent · US Expired

Quantization using frequency and mean compensated frequency input data for robust speech recognition

US6418412B1 · kind B1 · utility

31Cited by

26References

28Claims

0Family size

Assignee

Legerity, Inc. · US

Inventors

Safdar M. Asghar · Austin, US
Lin Cong · Austin, US

Key dates

Filing date	Aug 28, 2000
Grant date	Jul 9, 2002
Priority date	—
Expiry date	Aug 28, 2020

Classification

Technology area (CPC G)Physics
CPC primaryG10L15/144
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A speech recognition system utilizes multiple quantizers to process frequency parameters and mean compensated frequency parameters derived from an input signal. The quantizers may be matrix and vector quantizer pairs, and such quantizer pairs may also function as front ends to a second stage speech classifiers such as hidden Markov models (HMMs) and/or utilizes neural network postprocessing to, for example, improve speech recognition performance. Mean compensating the frequency parameters can remove noise frequency components that remain approximately constant during the duration of the input signal. HMM initial state and state transition probabilities derived from common quantizer types and the same input signal may be consolidated to improve recognition system performance and efficiency. Matrix quantization exploits the “evolution” of the speech short-term spectral envelopes as well as frequency domain information, and vector quantization (VQ) primarily operates on frequency domain information. Time domain information may be substantially limited which may introduce error into the matrix quantization, and the VQ may provide error compensation. The matrix and vector qu…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.