Patent · US Active

System and method of speaker reidentification in a multiple camera setting conference room

US11800057B2 · kind B2 · utility

0Cited by
3References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 31, 2021
Grant dateOct 24, 2023
Priority date
Expiry dateFeb 6, 2042

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH04S7/303
  • WIPO fieldAudio-visual technology
  • WIPO sectorElectrical engineering

Abstract

In a multi-camera videoconferencing configuration, the locations of each camera are known. By referencing a known object visible to each camera, a 3D coordinate system is developed, with the position and angle of each camera being associated with that 3D coordinate system. The locations of the conference participants in the 3D coordinate system are determined for each camera. Sound source localization (SSL) from one camera, generally a central camera, is used to determine the speaker. The pose of the speaker is then determined. From the pose and the known locations of the cameras, the camera with the best frontal view of the speaker is determined. The 3D coordinates of the speaker are then used to direct the determined camera to frame the speaker. If the face of the speaker is not sufficiently visible, the next best camera view is determined, and the speaker framed from that camera view.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.