Identifying far-end sound
US8219387B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 10, 2007 |
| Grant date | Jul 10, 2012 |
| Priority date | — |
| Expiry date | May 8, 2031 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2021/02166
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Frames containing audio data may be received, the audio data having been derived from a microphone array, at least some of the frames containing residual acoustic echo after having acoustic echo partially removed therefrom. Probability distribution functions are determined from the frames of audio data. A probability distribution function comprises likelihoods that respective directions are directions of sources of sounds. An active speaker may be identified in frames of video data based on the video data and based on audio information derived from the audio data, where use of the audio information as a basis for identifying the active speaker is controlled by determining whether the probability distribution functions indicate that corresponding audio data includes residual acoustic echo.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.