Systems and methods for automatic extraction and alignment of labels derived from camera feed for moving sound sources recorded with a microphone array
US11830239B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 13, 2022 |
| Grant date | Nov 28, 2023 |
| Priority date | — |
| Expiry date | Jul 13, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06T2207/10016
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method for labeling audio data includes receiving video stream data and audio stream data that corresponds to at least a portion of the video stream data. The method also includes labeling, at least some objects of the video stream data. The method also includes calculating at least one offset value for at least a portion of the audio stream data that corresponds to at least one labeled object of the video stream data. The method also includes synchronizing at least a portion of the video stream data with the portion of the audio stream data. The method also includes labeling at least the portion of the audio stream data that corresponds to the at least one labeled object of the video stream data and generating training data using at least some of the labeled portion of the audio stream data.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.