Word-level blind diarization of recorded calls with arbitrary number of speakers
US11636860B2 · kind B2 · utility
1Cited by
55References
20Claims
0Family size
Assignee
Inventors
Key dates
| Filing date | Jul 21, 2020 |
| Grant date | Apr 25, 2023 |
| Priority date | — |
| Expiry date | Jan 25, 2041 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L25/84
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Disclosed herein are methods of diarizing audio data using first-pass blind diarization and second-pass blind diarization that generate speaker statistical models, wherein the first pass-blind diarization is on a per-frame basis and the second pass-blind diarization is on a per-word basis, and methods of creating acoustic signatures for a common speaker based only on the statistical models of the speakers in each audio session.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.