Patent · US Active

Directional speech separation

US10755727B1 · kind B1 · utility

4Cited by

0References

20Claims

0Family size

Assignee

AMAZON TECHNOLOGIES, INC. · US

Inventor

Wai Chung Chu · San Jose, US

Key dates

Filing date	Sep 25, 2018
Grant date	Aug 25, 2020
Priority date	—
Expiry date	Mar 2, 2039

Classification

Technology area (CPC H)Electricity
CPC primaryH04R2430/20
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A system configured to perform directional speech separation. The system may dynamically associate direction-of-arrivals with one or more audio sources in order to generate output audio data that separates each of the audio sources. The system identifies a target direction for each audio source, dynamically determines directions that are correlated with the target direction, and generates output signals for each audio source. The system may associate individual frequency bands with specific directions based on a time delay detected by two or more microphones. The system may determine a cross-correlation between each direction and the target direction and select directions with strong correlation. The system may generate time-frequency mask data indicating frequency bands corresponding to the directions associated with a particular audio source. Using the mask data, the system generates output audio data specific to the audio source, resulting in directional speech separation between different audio sources.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.