Patent · US Active

Low-latency speech separation

US10856076B2 · kind B2 · utility

0Cited by

0References

18Claims

0Family size

Assignee

MICROSOFT TECHNOLOGY LICENSING, LLC · US

Inventors

Zhuo Chen · Markham, CA
Changliang Liu · Bothell, US
Takuya Yoshioka · Bellevue, US
Xiong XIAO · Bothell, US
Hakan Erdogan · Belmont, US
Dimitrios Dimitriadis · Rutherford, US

Key dates

Filing date	Apr 5, 2019
Grant date	Dec 1, 2020
Priority date	—
Expiry date	May 29, 2039

Classification

Technology area (CPC G)Physics
CPC primaryG10L2021/02166
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A system and method include reception of a first plurality of audio signals, generation of a second plurality of beamformed audio signals based on the first plurality of audio signals, each of the second plurality of beamformed audio signals associated with a respective one of a second plurality of beamformer directions, generation of a first TF mask for a first output channel based on the first plurality of audio signals, determination of a first beamformer direction associated with a first target sound source based on the first TF mask, generation of first features based on the first beamformer direction and the first plurality of audio signals, determination of a second TF mask based on the first features, and application of the second TF mask to one of the second plurality of beamformed audio signals associated with the first beamformer direction.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.