Patent · US Active

Utilizing machine learning models to provide cognitive speaker fractionalization with empathy recognition

US11715487B2 · kind B2 · utility

0Cited by
1References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 31, 2021
Grant dateAug 1, 2023
Priority date
Expiry dateOct 5, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L25/78
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A device may receive audio data identifying a plurality of speakers and may process the audio data, with a plurality of clustering models, to identify a plurality of speaker segments. The device may determine a plurality of diarization error rates for the plurality of speaker segments and may identify a plurality of errors in the plurality of speaker segments. The device may select rectification models to rectify the plurality of errors and may segment and/or re-segment the audio data with the rectification models to generate re-segmented audio data. The device may determine a plurality of modified diarization error rates for the plurality of speaker segments based on the re-segmented audio data and may select one of the plurality of speaker segments based on the plurality of modified diarization error rates. The device may calculate an empathy score based on the selected speaker segment and may perform actions based on the empathy score.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.