Patent · US Active

Auto focus on speaker during multi-participant communication conferencing

US12192669B2 · kind B2 · utility

0Cited by
6References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 8, 2022
Grant dateJan 7, 2025
Priority date
Expiry dateAug 30, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L17/00
  • WIPO fieldAudio-visual technology
  • WIPO sectorElectrical engineering

Abstract

A method for auto focus on a speaker during a communication session includes receiving video captured of a scene that includes a plurality of images of participants to the communication session, identifying the plurality of images of the participants in the video captured of the scene, recognizing audio from at least one of the participants, detecting facial movement in one of the images of the plurality of images and equating the recognized audio to the detected movement in the one of the images of the plurality of images. The method also includes selecting the one of the images of the plurality of images as a speaker based on the equated recognized audio to the detected movement in the one of the images, zooming in on the speaker and filtering out a remainder of the images of the plurality of images.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.