Perceptual optimization of magnitude and phase for time-frequency and softmask source separation systems
US12382234B2 · kind B2 · utility
Assignees
Inventors
Key dates
| Filing date | Jun 10, 2021 |
| Grant date | Aug 5, 2025 |
| Priority date | — |
| Expiry date | Apr 2, 2042 |
Classification
- Technology area (CPC H)Electricity
- CPC primaryH04S2400/11
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method comprises: obtaining softmask values for frequency bins of time-frequency tiles representing an audio signal; reducing, or expanding and limiting, the softmask values; and applying the reduced, or expanded and limited, softmask values to the frequency bins to create a time-frequency representation of an estimated target source. An alternative method comprises, for each time-frequency tile: obtaining softmask values; applying the softmask values to the frequency bins to create a time-frequency domain representation of an estimated target source; obtaining a panning parameter and a source concentration estimates for the target source; determining, using the panning parameter estimate and the softmask values, a magnitude for the time-frequency representation of the estimated target source; determining, using the panning parameter estimate and the source phase concentration estimate, a phase for the time-frequency representation of the estimated target source; and combining the magnitude and the phase.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.