Directional speech separation
US10755727B1 · kind B1 · utility
Assignee
Inventor
Key dates
| Filing date | Sep 25, 2018 |
| Grant date | Aug 25, 2020 |
| Priority date | — |
| Expiry date | Mar 2, 2039 |
Classification
- Technology area (CPC H)Electricity
- CPC primaryH04R2430/20
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system configured to perform directional speech separation. The system may dynamically associate direction-of-arrivals with one or more audio sources in order to generate output audio data that separates each of the audio sources. The system identifies a target direction for each audio source, dynamically determines directions that are correlated with the target direction, and generates output signals for each audio source. The system may associate individual frequency bands with specific directions based on a time delay detected by two or more microphones. The system may determine a cross-correlation between each direction and the target direction and select directions with strong correlation. The system may generate time-frequency mask data indicating frequency bands corresponding to the directions associated with a particular audio source. Using the mask data, the system generates output audio data specific to the audio source, resulting in directional speech separation between different audio sources.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.