Spatio-temporal speech enhancement technique based on generalized eigenvalue decomposition
US8374854B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 27, 2009 |
| Grant date | Feb 12, 2013 |
| Priority date | — |
| Expiry date | Mar 31, 2031 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2021/02168
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
The present invention describes a speech enhancement method using microphone arrays and a new iterative technique for enhancing noisy speech signals under low signal-to-noise-ratio (SNR) environments. A first embodiment involves the processing of the observed noisy speech both in the spatial- and the temporal-domains to enhance the desired signal component speech and an iterative technique to compute the generalized eigenvectors of the multichannel data derived from the microphone array. The entire processing is done on the spatio-temporal correlation coefficient sequence of the observed data in order to avoid large matrix-vector multiplications. A further embodiment relates to a speech enhancement system that is composed of two stages. In the first stage, the noise component of the observed signal is whitened, and in the second stage a spatio-temporal power method is used to extract the most dominant speech component. In both the stages, the filters are adapted using the multichannel spatio-temporal correlation coefficients of the data and hence avoid large matrix vector multiplications.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.