Patent · US Active

Multi-tap minimum variance distortionless response beamformer with neural networks for target speech separation

US11423906B2 · kind B2 · utility

1Cited by

1References

14Claims

0Family size

Assignee

TENCENT AMERICA LLC · US

Inventors

Yong Xu · Brooklyn, US
Meng Yu · Bellevue, US
Shi-Xiong Zhang · Redmond, US
Chao Weng · Fremont, US
Jianming Liu · Markham, CA
Dong Yu · Bellevue, US

Key dates

Filing date	Jul 10, 2020
Grant date	Aug 23, 2022
Priority date	—
Expiry date	Aug 27, 2040

Classification

Technology area (CPC G)Physics
CPC primaryG10L2021/02166
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method, computer system, and computer readable medium are provided for automatic speech recognition. Video data and audio data corresponding to one or more speakers is received. A minimum variance distortionless response function is applied to the received audio and video data. A predicted target waveform corresponding to a target speaker from among the one or more speakers is generated based on back-propagating the output of the applied minimum variance distortionless response function.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.