Patent · US Active

End-to-end speaker recognition using deep neural network

US10381009B2 · kind B2 · utility

3Cited by

0References

14Claims

0Family size

Assignee

Pindrop Security, Inc. · US

Inventors

Elie Khoury · Atlanta, US
Matthew Garland · Atlanta, US

Key dates

Filing date	Nov 20, 2017
Grant date	Aug 13, 2019
Priority date	—
Expiry date	Nov 20, 2037

Classification

Technology area (CPC G)Physics
CPC primaryG10L17/22
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

The present invention is directed to a deep neural network (DNN) having a triplet network architecture, which is suitable to perform speaker recognition. In particular, the DNN includes three feed-forward neural networks, which are trained according to a batch process utilizing a cohort set of negative training samples. After each batch of training samples is processed, the DNN may be trained according to a loss function, e.g., utilizing a cosine measure of similarity between respective samples, along with positive and negative margins, to provide a robust representation of voiceprints.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.