Patent · US Expired

Method and apparatus for predicting events in video conferencing and other applications

US6894714B2 · kind B2 · utility

86Cited by

12References

19Claims

0Family size

Assignee

KONINKLIJKE PHILIPS ELECTRONICS NV · NL

Inventors

Srinivas Gutta · Buchanan, US
Hugo J. Strubbe · Yorktown Heights, US
Antonio Colmenarez · Peekskill, US

Key dates

Filing date	Dec 5, 2000
Grant date	May 17, 2005
Priority date	—
Expiry date	Feb 21, 2022

Classification

Technology area (CPC H)Electricity
CPC primaryH04N7/15
WIPO fieldAudio-visual technology
WIPO sectorElectrical engineering

Abstract

Methods and apparatus are disclosed for predicting events using acoustic and visual cues. The present invention processes audio and video information to identify one or more (i) acoustic cues, such as intonation patterns, pitch and loudness, (ii) visual cues, such as gaze, facial pose, body postures, hand gestures and facial expressions, or (iii) a combination of the foregoing, that are typically associated with an event, such as behavior exhibited by a video conference participant before he or she speaks. In this manner, the present invention allows the video processing system to predict events, such as the identity of the next speaker. The predictive speaker identifier operates in a learning mode to learn the characteristic profile of each participant in terms of the concept that the participant “will speak” or “will not speak” under the presence or absence of one or more predefined visual or acoustic cues. The predictive speaker identifier operates in a predictive mode to compare the learned characteristics embodied in the characteristic profile to the audio and video information and thereby predict the next speaker.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.