Patent · US Active

Methods, apparatus, and systems for detection and extraction of spatially-identifiable subband audio sources

US12334098B2 · kind B2 · utility

0Cited by
6References
19Claims
0Family size

Assignees

Inventors

Key dates

Filing dateJun 11, 2021
Grant dateJun 17, 2025
Priority date
Expiry dateJan 31, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L21/0272
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

In an embodiment, a method comprises: transforming one or more frames of a two-channel time domain audio signal into a time-frequency domain representation including a plurality of time-frequency tiles, wherein the frequency domain of the time-frequency domain representation includes a plurality of frequency bins grouped into subbands. For each time-frequency tile, the method comprises: calculating spatial parameters and a level for the time-frequency tile; modifying the spatial parameters using shift and squeeze parameters; obtaining a softmask value for each frequency bin using the modified spatial parameters, the level and subband information; and applying the softmask values to the time-frequency tile to generate a modified time-frequency tile of an estimated audio source. In an embodiment, a plurality of frames of the time-frequency tiles are assembled into a plurality of chunks, wherein each chunk includes a plurality of subbands, and the method described above is performed on each subband of each chunk.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.