Patent · US Active

Method for speaker diarization

US10026405B2 · kind B2 · utility

3Cited by
3References
10Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 3, 2016
Grant dateJul 17, 2018
Priority date
Expiry dateJun 10, 2036

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2019/0005
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Disclosed is a speaker diarization process for determining which speaker is speaking at what time during the course of a conversation. The entire process can be most easily described in five main parts: Segmentation where speech/non-speech decisions are made; frame feature extraction where useful information is obtained from the frames; segment modeling where the information from the frame feature extraction is combined with segment start and end time information to create segment specific features; speaker decisions when the segments are clustered to create speaker models; and corrections where frame level corrections are applied to the information extracted.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.