Patent · US Active

Multi-channel speech separation

US10839822B2 · kind B2 · utility

13Cited by

4References

20Claims

0Family size

Assignee

MICROSOFT TECHNOLOGY LICENSING, LLC · US

Inventors

Zhuo Chen · Markham, CA
Jinyu Li · Beijing, CN
Xiong XIAO · Bothell, US
Takuya Yoshioka · Bellevue, US
Huaming Wang · Qingdao, CN
Zhenghao Wang · Redmond, US
Yifan Gong · Sammamish, US

Key dates

Filing date	Nov 6, 2017
Grant date	Nov 17, 2020
Priority date	—
Expiry date	Nov 6, 2037

Classification

Technology area (CPC H)Electricity
CPC primaryH04R2430/20
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Representative embodiments disclose mechanisms to separate and recognize multiple audio sources (e.g., picking out individual speakers) in an environment where they overlap and interfere with each other. The architecture uses a microphone array to spatially separate out the audio signals. The spatially filtered signals are then input into a plurality of separators, so each signal is input into a corresponding signal. The separators use neural networks to separate out audio sources. The separators typically produce multiple output signals for the single input signals. A post selection processor then assesses the separator outputs to pick the signals with the highest quality output. These signals can be used in a variety of systems such as speech recognition, meeting transcription and enhancement, hearing aids, music information retrieval, speech enhancement and so forth.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.