Robust speech processing with affine transform replicated data
US6038528A · kind A · utility
Assignee
Inventors
Key dates
| Filing date | Jul 17, 1996 |
| Grant date | Mar 14, 2000 |
| Priority date | — |
| Expiry date | Jul 17, 2016 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L21/0216
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
The present invention relates to a robust speech processing method and system which models channel and noise variations with affine transforms to reduce mismatched conditions between training and testing. The affine transform relating the training vectors C.sub.k with the vectors for testing condition c.sub.k', is represented by the form: EQU c'.sub.k.sup.T =Ac.sub.k.sup.T +b for k=1 to N in which A is a matrix of predicator coefficients representing noise distortions and vector b represents channel distortions. Alternatively, an affine invariant cepstrum is generated during testing and training for modeling speech to account for noise and channel effects. From the improved speech processing, improved speaker recognition with channel and noise variations is obtained.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.