Patent · US Active

Adaptive diarization model and user interface

US11710496B2 · kind B2 · utility

0Cited by

5References

20Claims

0Family size

Assignee

Google LLC · US

Inventors

Aaron Michael Donsbach · Seattle, US
Dirk Ryan Padfield · Niskayuna, US

Key dates

Filing date	Jul 1, 2019
Grant date	Jul 25, 2023
Priority date	—
Expiry date	Jul 1, 2039

Classification

Technology area (CPC G)Physics
CPC primaryG10L15/26
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A computing device receives a first audio waveform representing a first utterance and a second utterance. The computing device receives identity data indicating that the first utterance corresponds to a first speaker and the second utterance corresponds to a second speaker. The computing device determines, based on the first utterance, the second utterance, and the identity data, a diarization model configured to distinguish between utterances by the first speaker and utterances by the second speaker. The computing device receives, exclusively of receiving further identity data indicating a source speaker of a third utterance, a second audio waveform representing the third utterance. The computing device determines, by way of the diarization model and independently of the further identity data of the first type, the source speaker of the third utterance. The computing device updates the diarization model based on the third utterance and the determined source speaker.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.