Patent · US Active

Blind diarization of recorded calls with arbitrary number of speakers

US9881617B2 · kind B2 · utility

28Cited by
51References
16Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 1, 2016
Grant dateJan 30, 2018
Priority date
Expiry dateSep 1, 2036

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH04M2203/303
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

In a method of diarization of audio data, audio data is segmented into a plurality of utterances. Each utterance is represented as an utterance model representative of a plurality of feature vectors. The utterance models are clustered. A plurality of speaker models are constructed from the clustered utterance models. A hidden Markov model is constructed of the plurality of speaker models. A sequence of identified speaker models is decoded.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.