Patent · US Active

Multi-tap minimum variance distortionless response beamformer with neural networks for target speech separation

US11423906B2 · kind B2 · utility

1Cited by
1References
14Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 10, 2020
Grant dateAug 23, 2022
Priority date
Expiry dateAug 27, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2021/02166
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method, computer system, and computer readable medium are provided for automatic speech recognition. Video data and audio data corresponding to one or more speakers is received. A minimum variance distortionless response function is applied to the received audio and video data. A predicted target waveform corresponding to a target speaker from among the one or more speakers is generated based on back-propagating the output of the applied minimum variance distortionless response function.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.