Patent · US Active

System and method of speaker reidentification in a multiple camera setting conference room

US11800057B2 · kind B2 · utility

0Cited by

3References

20Claims

0Family size

Assignee

Plantronics, Inc. · US

Inventors

Yong Yan · Leander, US
Kui Zhang · Beijing, CN
David Young · Austin, US

Key dates

Filing date	Dec 31, 2021
Grant date	Oct 24, 2023
Priority date	—
Expiry date	Feb 6, 2042

Classification

Technology area (CPC H)Electricity
CPC primaryH04S7/303
WIPO fieldAudio-visual technology
WIPO sectorElectrical engineering

Abstract

In a multi-camera videoconferencing configuration, the locations of each camera are known. By referencing a known object visible to each camera, a 3D coordinate system is developed, with the position and angle of each camera being associated with that 3D coordinate system. The locations of the conference participants in the 3D coordinate system are determined for each camera. Sound source localization (SSL) from one camera, generally a central camera, is used to determine the speaker. The pose of the speaker is then determined. From the pose and the known locations of the cameras, the camera with the best frontal view of the speaker is determined. The 3D coordinates of the speaker are then used to direct the determined camera to frame the speaker. If the face of the speaker is not sufficiently visible, the next best camera view is determined, and the speaker framed from that camera view.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.