Speech signal separation and synthesis based on auditory scene analysis and speech modeling
US9536540B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jul 18, 2014 |
| Grant date | Jan 3, 2017 |
| Priority date | — |
| Expiry date | Aug 13, 2034 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L21/0208
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Provided are systems and methods for generating clean speech from a speech signal representing a mixture of a noise and speech. The clean speech may be generated from synthetic speech parameters. The synthetic speech parameters are derived based on the speech signal components and a model of speech using auditory and speech production principles. The modeling may utilize a source-filter structure of the speech signal. One or more spectral analyzes on the speech signal are performed to generate spectral representations. The feature data is derived based on a spectral representation. The features corresponding to the target speech according to a model of speech are grouped and separated from the feature data. The synthetic speech parameters, including spectral envelope, pitch data and voice classification data are generated based on features corresponding to the target speech.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.