Patent · US Active

Real-time target speaker audio enhancement

US12272371B1 · kind B1 · utility

0Cited by
4References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 30, 2021
Grant dateApr 8, 2025
Priority date
Expiry dateMar 26, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L17/04
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Real-time audio enhancement for a target speaker may be performed. An embedding of a sample of speaker audio is created using a trained neural network that performs voice identification. The embedding is then concatenated with the input features of a trained machine learning model for audio enhancement. The audio enhancement model can recognize and enhance a target speaker's speech in a real-time implementation, as the embedding is in the same feature space of the audio enhancement model.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.