Patent · US Expired

Methods and apparatus for audio-visual speaker recognition and utterance verification

US6219640A · kind A · utility

149Cited by
7References
61Claims
0Family size

Assignee

Inventors

Key dates

Filing dateAug 6, 1999
Grant dateApr 17, 2001
Priority date
Expiry dateAug 6, 2019

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2015/226
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods and apparatus for performing speaker recognition comprise processing a video signal associated with an arbitrary content video source and processing an audio signal associated with the video signal. Then, an identification and/or verification decision is made based on the processed audio signal and the processed video signal. Various decision making embodiments may be employed including, but not limited to, a score combination approach, a feature combination approach, and a re-scoring approach. In another aspect of the invention, a method of verifying a speech utterance comprises processing a video signal associated with a video source and processing an audio signal associated with the video signal. Then, the processed audio signal is compared with the processed video signal to determine a level of correlation between the signals. This is referred to as unsupervised utterance verification. In a supervised utterance verification embodiment, the processed video signal is compared with a script representing an audio signal associated with the video signal to determine a level of correlation between the signals.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.