Patent · US Active

Speech signal separation and synthesis based on auditory scene analysis and speech modeling

US9536540B2 · kind B2 · utility

9Cited by
289References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 18, 2014
Grant dateJan 3, 2017
Priority date
Expiry dateAug 13, 2034

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L21/0208
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Provided are systems and methods for generating clean speech from a speech signal representing a mixture of a noise and speech. The clean speech may be generated from synthetic speech parameters. The synthetic speech parameters are derived based on the speech signal components and a model of speech using auditory and speech production principles. The modeling may utilize a source-filter structure of the speech signal. One or more spectral analyzes on the speech signal are performed to generate spectral representations. The feature data is derived based on a spectral representation. The features corresponding to the target speech according to a model of speech are grouped and separated from the feature data. The synthetic speech parameters, including spectral envelope, pitch data and voice classification data are generated based on features corresponding to the target speech.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.