Patent · US Active

Speech extraction method, system, and device based on supervised learning auditory attention

US10923136B2 · kind B2 · utility

0Cited by

2References

10Claims

0Family size

Assignee

Institute of Automation, Chinese Academy of Sciences · CN

Inventors

Jiaming Xu · Beijing, CN
Yating Huang · Beijing, CN
Bo Xu · Beijing, CN

Key dates

Filing date	Apr 19, 2019
Grant date	Feb 16, 2021
Priority date	—
Expiry date	Apr 19, 2039

Classification

Technology area (CPC G)Physics
CPC primaryG10L2021/02087
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A speech extraction method based on the supervised learning auditory attention includes: converting an original overlapping speech signal into a two-dimensional time-frequency signal representation by a short-time Fourier transform to obtain a first overlapping speech signal; performing a first sparsification on the first overlapping speech signal, mapping intensity information of a time-frequency unit of the first overlapping speech signal to preset D intensity levels, and performing a second sparsification on the first overlapping speech signal based on information of the preset D intensity levels to obtain a second overlapping speech signal; converting the second overlapping speech signal into a pulse signal by a time coding method; extracting a target pulse from the pulse signal by a trained target pulse extraction network; converting the target pulse into a time-frequency representation of the target speech to obtain the target speech by an inverse short-time Fourier transform.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.