Patent · US Active

Speaker identification accuracy

US11468900B2 · kind B2 · utility

0Cited by

6References

18Claims

0Family size

Assignee

Google LLC · US

Inventors

Yeming Fang · Santa Clara, US
Quan Wang · Hoboken, US
Pedro J. Moreno Mengibar · Jersey City, US
Ignacio Lopez Moreno · Brooklyn, US
Gang Feng · Bellevue, US
Fang Chu · Santa Clara, US
Jin Shi · Hangzhou City, CN
Jason Pelecanos · Mountain View, US

Key dates

Filing date	Oct 15, 2020
Grant date	Oct 11, 2022
Priority date	—
Expiry date	Oct 15, 2040

Classification

Technology area (CPC G)Physics
CPC primaryG10L17/08
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method of generating an accurate speaker representation for an audio sample includes receiving a first audio sample from a first speaker and a second audio sample from a second speaker. The method includes dividing a respective audio sample into a plurality of audio slices. The method also includes, based on the plurality of slices, generating a set of candidate acoustic embeddings where each candidate acoustic embedding includes a vector representation of acoustic features. The method further includes removing a subset of the candidate acoustic embeddings from the set of candidate acoustic embeddings. The method additionally includes generating an aggregate acoustic embedding from the remaining candidate acoustic embeddings in the set of candidate acoustic embeddings after removing the subset of the candidate acoustic embeddings.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.