Method for speaker diarization
US10026405B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | May 3, 2016 |
| Grant date | Jul 17, 2018 |
| Priority date | — |
| Expiry date | Jun 10, 2036 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2019/0005
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Disclosed is a speaker diarization process for determining which speaker is speaking at what time during the course of a conversation. The entire process can be most easily described in five main parts: Segmentation where speech/non-speech decisions are made; frame feature extraction where useful information is obtained from the frames; segment modeling where the information from the frame feature extraction is combined with segment start and end time information to create segment specific features; speaker decisions when the segments are clustered to create speaker models; and corrections where frame level corrections are applied to the information extracted.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.