Patent · US Active

Speech enhancement for target speakers

US9741360B1 · kind B1 · utility

17Cited by

4References

17Claims

0Family size

Assignee

Spectimbre Inc. · US

Inventors

Xi Li · Allen, US
Yan Lu · Beijing, CN

Key dates

Filing date	Oct 9, 2016
Grant date	Aug 22, 2017
Priority date	—
Expiry date	Oct 9, 2036

Classification

Technology area (CPC G)Physics
CPC primaryG10L21/0308
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method of speech enhancement for target speakers is presented. A blind source separation (BSS) module is used to separate a plurality of microphone recorded audio mixtures into statistically independent audio components. At least one of a plurality of speaker profiles are used to score and weight each audio components, and a speech mixer is used to first mix the weighted audio components, then align the mixed signals, and finally add the aligned signals to generate an extracted speech signal. Similarly, a noise mixer is used to first weight the audio components, then mix the weighted signals, and finally add the mixed signals to generate an extracted noise signal. Post processing is used to further enhance the extracted speech signal with a Wiener filtering or spectral subtraction procedure by subtracting the shaped power spectrum of extracted noise signal from that of the extracted speech signal.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.