Methods, apparatus, and systems for detection and extraction of spatially-identifiable subband audio sources
US12334098B2 · kind B2 · utility
Assignees
Inventors
Key dates
| Filing date | Jun 11, 2021 |
| Grant date | Jun 17, 2025 |
| Priority date | — |
| Expiry date | Jan 31, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L21/0272
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
In an embodiment, a method comprises: transforming one or more frames of a two-channel time domain audio signal into a time-frequency domain representation including a plurality of time-frequency tiles, wherein the frequency domain of the time-frequency domain representation includes a plurality of frequency bins grouped into subbands. For each time-frequency tile, the method comprises: calculating spatial parameters and a level for the time-frequency tile; modifying the spatial parameters using shift and squeeze parameters; obtaining a softmask value for each frequency bin using the modified spatial parameters, the level and subband information; and applying the softmask values to the time-frequency tile to generate a modified time-frequency tile of an estimated audio source. In an embodiment, a plurality of frames of the time-frequency tiles are assembled into a plurality of chunks, wherein each chunk includes a plurality of subbands, and the method described above is performed on each subband of each chunk.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.