Matching active speaker pose between two cameras
US11985417B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 16, 2022 |
| Grant date | May 14, 2024 |
| Priority date | — |
| Expiry date | Sep 23, 2042 |
Classification
- Technology area (CPC H)Electricity
- CPC primaryH04R2430/23
- WIPO fieldAudio-visual technology
- WIPO sectorElectrical engineering
Abstract
Described are multiple cameras in a conference room, each pointed in a different direction. A primary camera includes a microphone array to perform sound source localization (SSL). The SSL is used in combination with a video image to identify the speaker from among multiple individuals that appear in the video image. Pose information of the speaker is developed. Pose information of each individual identified in each other camera is developed. The speaker pose information is compared to the pose information of the individuals from the other cameras. The best match for each other camera is selected as the speaker in that camera. The speaker views of each camera are compared to determine the speaker view with the most frontal view of the speaker. That camera is selected to provide the video for provision to the far end.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.