Patent · US Active

Multi-channel speech separation

US10839822B2 · kind B2 · utility

13Cited by
4References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateNov 6, 2017
Grant dateNov 17, 2020
Priority date
Expiry dateNov 6, 2037

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH04R2430/20
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Representative embodiments disclose mechanisms to separate and recognize multiple audio sources (e.g., picking out individual speakers) in an environment where they overlap and interfere with each other. The architecture uses a microphone array to spatially separate out the audio signals. The spatially filtered signals are then input into a plurality of separators, so each signal is input into a corresponding signal. The separators use neural networks to separate out audio sources. The separators typically produce multiple output signals for the single input signals. A post selection processor then assesses the separator outputs to pick the signals with the highest quality output. These signals can be used in a variety of systems such as speech recognition, meeting transcription and enhancement, hearing aids, music information retrieval, speech enhancement and so forth.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.