Patent · US Active

Audiovisual source separation and localization using generative adversarial networks

US11501532B2 · kind B2 · utility

6Cited by
1References
25Claims
0Family size

Assignee

Inventors

Key dates

Filing dateApr 25, 2019
Grant dateNov 15, 2022
Priority date
Expiry dateSep 15, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/084
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method (and structure and computer product) for an audiovisual source separation processing includes receiving video data showing images of a plurality of sound sources into a video encoder, while concurrently receiving into the video encoder optical flow data of the video data, the optical flow data indicating motions of pixels between frames of the video data. The video encoder encodes the received video data into video localization data comprising information associating pixels in the frames of video data with different channels of sound and encodes the received optical flow data into video separation data comprising information associating motion information in the frames of video data with the different channels of sound.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.