Patent · US Expired

Quantization using frequency and mean compensated frequency input data for robust speech recognition

US6219642A · kind A · utility

28Cited by
9References
7Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 5, 1998
Grant dateApr 17, 2001
Priority date
Expiry dateOct 5, 2018

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L15/144
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A speech recognition system utilizes multiple quantizers to process frequency parameters and mean compensated frequency parameters derived from an input signal. The quantizers may be matrix and vector quantizer pairs, and such quantizer pairs may also function as front ends to a second stage speech classifiers such as hidden Markov models (HMMs) and/or utilizes neural network postprocessing to, for example, improve speech recognition performance. Mean compensating the frequency parameters can remove noise frequency components that remain approximately constant during the duration of the input signal. HMM initial state and state transition probabilities derived from common quantizer types and the same input signal may be consolidated to improve recognition system performance and efficiency. Matrix quantization exploits the "evolution" of the speech short-term spectral envelopes as well as frequency domain information, and vector quantization (VQ) primarily operates on frequency domain information. Time domain information may be substantially limited which may introduce error into the matrix quantization, and the VQ may provide error compensation. The matrix and vector quantizers may…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.