Word-level blind diarization of recorded calls with arbitrary number of speakers
US9875742B2 · kind B2 · utility
22Cited by
52References
16Claims
0Family size
Assignee
Inventors
Key dates
| Filing date | Jan 26, 2016 |
| Grant date | Jan 23, 2018 |
| Priority date | — |
| Expiry date | Jan 26, 2036 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L25/84
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Disclosed herein are methods of diarizing audio data using first-pass blind diarization and second-pass blind diarization that generate speaker statistical models, wherein the first pass-blind diarization is on a per-frame basis and the second pass-blind diarization is on a per-word basis, and methods of creating acoustic signatures for a common speaker based only on the statistical models of the speakers in each audio session.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.