Patent · US Active

Real-time target speaker audio enhancement

US12272371B1 · kind B1 · utility

0Cited by

4References

20Claims

0Family size

Assignee

AMAZON TECHNOLOGIES, INC. · US

Inventors

Ritwik Giri · Sunnyvale, US
Shrikant Venkataramani · Champaign, US
Jean-Marc Valin · Montréal, CA
Mehmet Umut Isik · Menlo Park, US
Arvindh Krishnaswamy · San Jose, US

Key dates

Filing date	Jun 30, 2021
Grant date	Apr 8, 2025
Priority date	—
Expiry date	Mar 26, 2043

Classification

Technology area (CPC G)Physics
CPC primaryG10L17/04
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Real-time audio enhancement for a target speaker may be performed. An embedding of a sample of speaker audio is created using a trained neural network that performs voice identification. The embedding is then concatenated with the input features of a trained machine learning model for audio enhancement. The audio enhancement model can recognize and enhance a target speaker's speech in a real-time implementation, as the embedding is in the same feature space of the audio enhancement model.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.