Patent · US Active

Method for speaker diarization

US10026405B2 · kind B2 · utility

3Cited by

3References

10Claims

0Family size

Assignee

SESTEK SES VE ILETISIM BILGISAYAR TEK. SAN VE TIC A.S. · TR

Inventors

Mustafa Levent Arslan · İstanbul, TR
Mustafa ERDEN · Paris, FR
Sedat Demirba{hacek over (g)} · İstanbul, TR
Gökçe Sarar · İstanbul, TR

Key dates

Filing date	May 3, 2016
Grant date	Jul 17, 2018
Priority date	—
Expiry date	Jun 10, 2036

Classification

Technology area (CPC G)Physics
CPC primaryG10L2019/0005
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Disclosed is a speaker diarization process for determining which speaker is speaking at what time during the course of a conversation. The entire process can be most easily described in five main parts: Segmentation where speech/non-speech decisions are made; frame feature extraction where useful information is obtained from the frames; segment modeling where the information from the frame feature extraction is combined with segment start and end time information to create segment specific features; speaker decisions when the segments are clustered to create speaker models; and corrections where frame level corrections are applied to the information extracted.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.