Patent · US Expired

Methods and apparatus for audio-visual speech detection and recognition

US6594629B1 · kind B1 · utility

100Cited by

12References

21Claims

0Family size

Assignee

International Business Machines Corporation · US

Inventors

Sankar Basu · Tenafly, US
Philippe De Cuetos · Allauch, FR
Stephane H. Maes · Fremont, US
Chalapathy Neti · Yorktown Heights, US
Andrew W. Senior · New York, US

Key dates

Filing date	Aug 6, 1999
Grant date	Jul 15, 2003
Priority date	—
Expiry date	Aug 6, 2019

Classification

Technology area (CPC G)Physics
CPC primaryG10L25/78
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

In a first aspect of the invention, methods and apparatus for providing speech recognition comprise the steps of processing a video signal associated with an arbitrary content video source, processing an audio signal associated with the video signal, and decoding the processed audio signal in conjunction with the processed video signal to generate a decoded output signal representative of the audio signal. In a second aspect 6f the invention, methods and apparatus for providing speech detection in accordance with a speech recognition system comprise the steps of processing a video signal associated with a video source to detect whether one or more features associated with the video signal are representative of speech, and processing an audio signal associated with the video signal in accordance with the speech recognition system to generate a decoded output signal representative of the audio signal when the one or more features associated with the video signal are representative of speech. Speech detection may also be performed using information from both the video path and the audio path simultaneously.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.