Metric learning of speaker diarization
US11651767B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 3, 2020 |
| Grant date | May 16, 2023 |
| Priority date | — |
| Expiry date | Nov 27, 2040 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N3/084
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A computer-implemented method includes obtaining training data including utterances of speakers in acoustic conditions, preparing at least one machine learning model, each machine learning model including a common embedding model for converting an utterance into a feature vector and a classification model for classifying the feature vector, and training, by using the training data, the machine learning model to perform classification by speaker and to perform classification by acoustic condition.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.