Patent · US Expired

Quantization using frequency and mean compensated frequency input data for robust speech recognition

US6418412B1 · kind B1 · utility

31Cited by
26References
28Claims
0Family size

Assignee

Inventors

Key dates

Filing dateAug 28, 2000
Grant dateJul 9, 2002
Priority date
Expiry dateAug 28, 2020

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L15/144
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A speech recognition system utilizes multiple quantizers to process frequency parameters and mean compensated frequency parameters derived from an input signal. The quantizers may be matrix and vector quantizer pairs, and such quantizer pairs may also function as front ends to a second stage speech classifiers such as hidden Markov models (HMMs) and/or utilizes neural network postprocessing to, for example, improve speech recognition performance. Mean compensating the frequency parameters can remove noise frequency components that remain approximately constant during the duration of the input signal. HMM initial state and state transition probabilities derived from common quantizer types and the same input signal may be consolidated to improve recognition system performance and efficiency. Matrix quantization exploits the “evolution” of the speech short-term spectral envelopes as well as frequency domain information, and vector quantization (VQ) primarily operates on frequency domain information. Time domain information may be substantially limited which may introduce error into the matrix quantization, and the VQ may provide error compensation. The matrix and vector qu…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.