Speech extraction method, system, and device based on supervised learning auditory attention
US10923136B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Apr 19, 2019 |
| Grant date | Feb 16, 2021 |
| Priority date | — |
| Expiry date | Apr 19, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2021/02087
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A speech extraction method based on the supervised learning auditory attention includes: converting an original overlapping speech signal into a two-dimensional time-frequency signal representation by a short-time Fourier transform to obtain a first overlapping speech signal; performing a first sparsification on the first overlapping speech signal, mapping intensity information of a time-frequency unit of the first overlapping speech signal to preset D intensity levels, and performing a second sparsification on the first overlapping speech signal based on information of the preset D intensity levels to obtain a second overlapping speech signal; converting the second overlapping speech signal into a pulse signal by a time coding method; extracting a target pulse from the pulse signal by a trained target pulse extraction network; converting the target pulse into a time-frequency representation of the target speech to obtain the target speech by an inverse short-time Fourier transform.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.