Quantization using frequency and mean compensated frequency input data for robust speech recognition
US6418412B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Aug 28, 2000 |
| Grant date | Jul 9, 2002 |
| Priority date | — |
| Expiry date | Aug 28, 2020 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L15/144
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A speech recognition system utilizes multiple quantizers to process frequency parameters and mean compensated frequency parameters derived from an input signal. The quantizers may be matrix and vector quantizer pairs, and such quantizer pairs may also function as front ends to a second stage speech classifiers such as hidden Markov models (HMMs) and/or utilizes neural network postprocessing to, for example, improve speech recognition performance. Mean compensating the frequency parameters can remove noise frequency components that remain approximately constant during the duration of the input signal. HMM initial state and state transition probabilities derived from common quantizer types and the same input signal may be consolidated to improve recognition system performance and efficiency. Matrix quantization exploits the “evolution” of the speech short-term spectral envelopes as well as frequency domain information, and vector quantization (VQ) primarily operates on frequency domain information. Time domain information may be substantially limited which may introduce error into the matrix quantization, and the VQ may provide error compensation. The matrix and vector qu…
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.