Patent · US Active

System and method of diarization and labeling of audio data

US10950242B2 · kind B2 · utility

3Cited by
51References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 4, 2019
Grant dateMar 16, 2021
Priority date
Expiry dateDec 4, 2039

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L17/02
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.