System and method of speaker reidentification in a multiple camera setting conference room
US11800057B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Dec 31, 2021 |
| Grant date | Oct 24, 2023 |
| Priority date | — |
| Expiry date | Feb 6, 2042 |
Classification
- Technology area (CPC H)Electricity
- CPC primaryH04S7/303
- WIPO fieldAudio-visual technology
- WIPO sectorElectrical engineering
Abstract
In a multi-camera videoconferencing configuration, the locations of each camera are known. By referencing a known object visible to each camera, a 3D coordinate system is developed, with the position and angle of each camera being associated with that 3D coordinate system. The locations of the conference participants in the 3D coordinate system are determined for each camera. Sound source localization (SSL) from one camera, generally a central camera, is used to determine the speaker. The pose of the speaker is then determined. From the pose and the known locations of the cameras, the camera with the best frontal view of the speaker is determined. The 3D coordinates of the speaker are then used to direct the determined camera to frame the speaker. If the face of the speaker is not sufficiently visible, the next best camera view is determined, and the speaker framed from that camera view.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.