Noise suppression for speech processing based on machine-learning mask estimation
US9640194B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Oct 4, 2013 |
| Grant date | May 2, 2017 |
| Priority date | — |
| Expiry date | Aug 19, 2034 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2021/02165
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Described are noise suppression techniques applicable to various systems including automatic speech processing systems in digital audio pre-processing. The noise suppression techniques utilize a machine-learning framework trained on cues pertaining to reference clean and noisy speech signals, and a corresponding synthetic noisy speech signal combining the clean and noisy speech signals. The machine-learning technique is further used to process audio signals in real time by extracting and analyzing cues pertaining to noisy speech to dynamically generate an appropriate gain mask, which may eliminate the noise components from the input audio signal. The audio signal pre-processed in such a manner may be applied to an automatic speech processing engine for corresponding interpretation or processing. The machine-learning technique may enable extraction of cues associated with clean automatic speech processing features, which may be used by the automatic speech processing engine for various automatic speech processing.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.