Multi-channel speech separation
US10839822B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Nov 6, 2017 |
| Grant date | Nov 17, 2020 |
| Priority date | — |
| Expiry date | Nov 6, 2037 |
Classification
- Technology area (CPC H)Electricity
- CPC primaryH04R2430/20
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Representative embodiments disclose mechanisms to separate and recognize multiple audio sources (e.g., picking out individual speakers) in an environment where they overlap and interfere with each other. The architecture uses a microphone array to spatially separate out the audio signals. The spatially filtered signals are then input into a plurality of separators, so each signal is input into a corresponding signal. The separators use neural networks to separate out audio sources. The separators typically produce multiple output signals for the single input signals. A post selection processor then assesses the separator outputs to pick the signals with the highest quality output. These signals can be used in a variety of systems such as speech recognition, meeting transcription and enhancement, hearing aids, music information retrieval, speech enhancement and so forth.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.