System and method for disambiguating a source of sound based on detected lip movement
US11200902B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Feb 15, 2019 |
| Grant date | Dec 14, 2021 |
| Priority date | — |
| Expiry date | Apr 7, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L21/02
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
The present teaching relates to method, system, medium, and implementations for detecting a source of speech sound in a dialogue. A visual signal acquired from a dialogue scene is first received, where the visual signal captures a person present in the dialogue scene. A human lip associated with the person is detected from the visual signal and tracked to detect whether lip movement is observed. If lip movement is detected, a first candidate source of sound is generated corresponding to an area in the dialogue scene where the lip movement occurred.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.