Patent · US Active

Systems and methods for automatic extraction and alignment of labels derived from camera feed for moving sound sources recorded with a microphone array

US11830239B1 · kind B1 · utility

0Cited by
0References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 13, 2022
Grant dateNov 28, 2023
Priority date
Expiry dateJul 13, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06T2207/10016
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method for labeling audio data includes receiving video stream data and audio stream data that corresponds to at least a portion of the video stream data. The method also includes labeling, at least some objects of the video stream data. The method also includes calculating at least one offset value for at least a portion of the audio stream data that corresponds to at least one labeled object of the video stream data. The method also includes synchronizing at least a portion of the video stream data with the portion of the audio stream data. The method also includes labeling at least the portion of the audio stream data that corresponds to the at least one labeled object of the video stream data and generating training data using at least some of the labeled portion of the audio stream data.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.