Patent · US Expired

Methods and apparatus for audio-visual speaker recognition and utterance verification

US6219640A · kind A · utility

149Cited by

7References

61Claims

0Family size

Assignee

International Business Machines Corporation · US

Inventors

Sankar Basu · Tenafly, US
Homayoon Beigi · Yorktown Heights, US
Stephane H. Maes · Fremont, US
Benoit Emmanuel Ghislain Maison · White Plains, US
Chalapathy Neti · Yorktown Heights, US
Andrew W. Senior · New York, US

Key dates

Filing date	Aug 6, 1999
Grant date	Apr 17, 2001
Priority date	—
Expiry date	Aug 6, 2019

Classification

Technology area (CPC G)Physics
CPC primaryG10L2015/226
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Methods and apparatus for performing speaker recognition comprise processing a video signal associated with an arbitrary content video source and processing an audio signal associated with the video signal. Then, an identification and/or verification decision is made based on the processed audio signal and the processed video signal. Various decision making embodiments may be employed including, but not limited to, a score combination approach, a feature combination approach, and a re-scoring approach. In another aspect of the invention, a method of verifying a speech utterance comprises processing a video signal associated with a video source and processing an audio signal associated with the video signal. Then, the processed audio signal is compared with the processed video signal to determine a level of correlation between the signals. This is referred to as unsupervised utterance verification. In a supervised utterance verification embodiment, the processed video signal is compared with a script representing an audio signal associated with the video signal to determine a level of correlation between the signals.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.