Patent · US Active

System and method for disambiguating a source of sound based on detected lip movement

US11200902B2 · kind B2 · utility

0Cited by
3References
18Claims
0Family size

Assignee

Inventors

Key dates

Filing dateFeb 15, 2019
Grant dateDec 14, 2021
Priority date
Expiry dateApr 7, 2039

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L21/02
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The present teaching relates to method, system, medium, and implementations for detecting a source of speech sound in a dialogue. A visual signal acquired from a dialogue scene is first received, where the visual signal captures a person present in the dialogue scene. A human lip associated with the person is detected from the visual signal and tracked to detect whether lip movement is observed. If lip movement is detected, a first candidate source of sound is generated corresponding to an area in the dialogue scene where the lip movement occurred.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.