Low-latency speech separation
US10856076B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Apr 5, 2019 |
| Grant date | Dec 1, 2020 |
| Priority date | — |
| Expiry date | May 29, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2021/02166
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system and method include reception of a first plurality of audio signals, generation of a second plurality of beamformed audio signals based on the first plurality of audio signals, each of the second plurality of beamformed audio signals associated with a respective one of a second plurality of beamformer directions, generation of a first TF mask for a first output channel based on the first plurality of audio signals, determination of a first beamformer direction associated with a first target sound source based on the first TF mask, generation of first features based on the first beamformer direction and the first plurality of audio signals, determination of a second TF mask based on the first features, and application of the second TF mask to one of the second plurality of beamformed audio signals associated with the first beamformer direction.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.