Blind diarization of recorded calls with arbitrary number of speakers
US9460722B2 · kind B2 · utility
45Cited by
45References
13Claims
0Family size
Assignee
Inventors
Key dates
| Filing date | Jun 30, 2014 |
| Grant date | Oct 4, 2016 |
| Priority date | — |
| Expiry date | Nov 30, 2034 |
Classification
- Technology area (CPC H)Electricity
- CPC primaryH04M2203/303
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
In a method of diarization of audio data, audio data is segmented into a plurality of utterances. Each utterance is represented as an utterance model representative of a plurality of feature vectors. The utterance models are clustered. A plurality of speaker models are constructed from the clustered utterance models. A hidden Markov model is constructed of the plurality of speaker models. A sequence of identified speaker models is decoded.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.