Patent · US Active

Blind diarization of recorded calls with arbitrary number of speakers

US9460722B2 · kind B2 · utility

45Cited by
45References
13Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 30, 2014
Grant dateOct 4, 2016
Priority date
Expiry dateNov 30, 2034

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH04M2203/303
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

In a method of diarization of audio data, audio data is segmented into a plurality of utterances. Each utterance is represented as an utterance model representative of a plurality of feature vectors. The utterance models are clustered. A plurality of speaker models are constructed from the clustered utterance models. A hidden Markov model is constructed of the plurality of speaker models. A sequence of identified speaker models is decoded.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.