Real-time target speaker audio enhancement
US12272371B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 30, 2021 |
| Grant date | Apr 8, 2025 |
| Priority date | — |
| Expiry date | Mar 26, 2043 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L17/04
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Real-time audio enhancement for a target speaker may be performed. An embedding of a sample of speaker audio is created using a trained neural network that performs voice identification. The embedding is then concatenated with the input features of a trained machine learning model for audio enhancement. The audio enhancement model can recognize and enhance a target speaker's speech in a real-time implementation, as the embedding is in the same feature space of the audio enhancement model.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.