Patent · US Active

Speech enhancement for target speakers

US9741360B1 · kind B1 · utility

17Cited by
4References
17Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 9, 2016
Grant dateAug 22, 2017
Priority date
Expiry dateOct 9, 2036

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L21/0308
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method of speech enhancement for target speakers is presented. A blind source separation (BSS) module is used to separate a plurality of microphone recorded audio mixtures into statistically independent audio components. At least one of a plurality of speaker profiles are used to score and weight each audio components, and a speech mixer is used to first mix the weighted audio components, then align the mixed signals, and finally add the aligned signals to generate an extracted speech signal. Similarly, a noise mixer is used to first weight the audio components, then mix the weighted signals, and finally add the mixed signals to generate an extracted noise signal. Post processing is used to further enhance the extracted speech signal with a Wiener filtering or spectral subtraction procedure by subtracting the shaped power spectrum of extracted noise signal from that of the extracted speech signal.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.