Patent · US Active

System and method of diarization and labeling of audio data

US10720164B2 · kind B2 · utility

0Cited by
50References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateDec 4, 2019
Grant dateJul 21, 2020
Priority date
Expiry dateDec 4, 2039

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L17/02
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems and methods of diarization using linguistic labeling include receiving a set of diarized textual transcripts. A least one heuristic is automatedly applied to the diarized textual transcripts to select transcripts likely to be associated with an identified group of speakers. The selected transcripts are analyzed to create at least one linguistic model. The linguistic model is applied to transcripted audio data to label a portion of the transcripted audio data as having been spoken by the identified group of speakers. Still further embodiments of diarization using linguistic labeling may serve to label agent speech and customer speech in a recorded and transcripted customer service interaction.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.