Identifying speech portions of a sound model using various statistics thereof
US9058820B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | May 21, 2013 |
| Grant date | Jun 16, 2015 |
| Priority date | — |
| Expiry date | Jan 11, 2034 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L21/14
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Speech portions of a sound model may be identified using various statistics associated with the sound model for voice enhancement of noisy audio signals. A spectral motion transform may be performed on an input signal to obtain a linear fit in time of a sound model of the input signal. Statistics may be extracted from the linear fit of the sound model of the input signal. Speech portions of the linear fit of the sound model of the input signal may be identified by detecting a presence of harmonics as a function of time in the linear fit of the sound model of the input signal based on individual ones of the extracted statistics. An output signal may be provided that conveys audio comprising a reconstructed speech component of the input signal with a noise component of the input signal being suppressed.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.