Patent · US Active

Target speaker separation system, device and storage medium

US11978470B2 · kind B2 · utility

0Cited by

1References

10Claims

0Family size

Assignee

Institute of Automation, Chinese Academy of Sciences · CN

Inventors

Jiaming Xu · Beijing, CN
Jian Cui · Waltham, US
Bo Xu · Beijing, CN

Key dates

Filing date	Nov 3, 2022
Grant date	May 7, 2024
Priority date	—
Expiry date	Nov 4, 2042

Classification

Technology area (CPC H)Electricity
CPC primaryH04S1/007
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Disclosed are a target speaker separation system, an electronic device and a storage medium. The system includes: first, performing, jointly unified modeling on a plurality of cues based a masked pre-training strategy, to boost the inference capability of a model for missing cues and enhance the representation accuracy of disturbed cues; and second, constructing a hierarchical cue modulation module. A spatial cue is introduced into a primary cue modulation module for directional enhancement of a speech of a speaker; in an intermediate cue modulation module, the speech of the speaker is enhanced on the basis of temporal coherence of a dynamic cue and an auditory signal component; a steady-state cue is introduced into an advanced cue modulation module for selective filtering; and finally, the supervised learning capability of simulation data and the unsupervised learning effect of real mixed data are sufficiently utilized.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.