Audio zoom based on speaker detection using lip reading
US11250869B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 28, 2020 |
| Grant date | Feb 15, 2022 |
| Priority date | — |
| Expiry date | Jul 28, 2040 |
Classification
- Technology area (CPC H)Electricity
- CPC primaryH04R2430/20
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Disclosed are an electronic device performing an audio zoom based on speaker detection using lip reading and a method for controlling the electronic device. According to an embodiment, the electronic device detects a direction of a sound source while recording a video and determines a speaker's direction via facial recognition and mouth shape recognition in the sound source direction. Microphone beamforming may be performed based on the speaker's direction. Thus, the accuracy of audio zoom may be enhanced.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.