Methods and apparatus for audio-visual speaker recognition and utterance verification
US6219640A · kind A · utility
Assignee
Inventors
Key dates
| Filing date | Aug 6, 1999 |
| Grant date | Apr 17, 2001 |
| Priority date | — |
| Expiry date | Aug 6, 2019 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/226
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Methods and apparatus for performing speaker recognition comprise processing a video signal associated with an arbitrary content video source and processing an audio signal associated with the video signal. Then, an identification and/or verification decision is made based on the processed audio signal and the processed video signal. Various decision making embodiments may be employed including, but not limited to, a score combination approach, a feature combination approach, and a re-scoring approach. In another aspect of the invention, a method of verifying a speech utterance comprises processing a video signal associated with a video source and processing an audio signal associated with the video signal. Then, the processed audio signal is compared with the processed video signal to determine a level of correlation between the signals. This is referred to as unsupervised utterance verification. In a supervised utterance verification embodiment, the processed video signal is compared with a script representing an audio signal associated with the video signal to determine a level of correlation between the signals.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.