Methods and apparatus for audio-visual speech detection and recognition
US6594629B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Aug 6, 1999 |
| Grant date | Jul 15, 2003 |
| Priority date | — |
| Expiry date | Aug 6, 2019 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L25/78
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
In a first aspect of the invention, methods and apparatus for providing speech recognition comprise the steps of processing a video signal associated with an arbitrary content video source, processing an audio signal associated with the video signal, and decoding the processed audio signal in conjunction with the processed video signal to generate a decoded output signal representative of the audio signal. In a second aspect 6f the invention, methods and apparatus for providing speech detection in accordance with a speech recognition system comprise the steps of processing a video signal associated with a video source to detect whether one or more features associated with the video signal are representative of speech, and processing an audio signal associated with the video signal in accordance with the speech recognition system to generate a decoded output signal representative of the audio signal when the one or more features associated with the video signal are representative of speech. Speech detection may also be performed using information from both the video path and the audio path simultaneously.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.