Patent · US Expired

Robust speech processing with affine transform replicated data

US6038528A · kind A · utility

43Cited by

12References

7Claims

0Family size

Assignee

T-Netix, Inc. · US

Inventors

Richard Mammone
Xiaoyu Zhang · Suzhou, CN

Key dates

Filing date	Jul 17, 1996
Grant date	Mar 14, 2000
Priority date	—
Expiry date	Jul 17, 2016

Classification

Technology area (CPC G)Physics
CPC primaryG10L21/0216
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

The present invention relates to a robust speech processing method and system which models channel and noise variations with affine transforms to reduce mismatched conditions between training and testing. The affine transform relating the training vectors C.sub.k with the vectors for testing condition c.sub.k', is represented by the form: EQU c'.sub.k.sup.T =Ac.sub.k.sup.T +b for k=1 to N in which A is a matrix of predicator coefficients representing noise distortions and vector b represents channel distortions. Alternatively, an affine invariant cepstrum is generated during testing and training for modeling speech to account for noise and channel effects. From the improved speech processing, improved speaker recognition with channel and noise variations is obtained.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.