Patent · US Active

Voice shortcut detection with speaker verification

US12033641B2 · kind B2 · utility

0Cited by

1References

20Claims

0Family size

Assignee

Google LLC · US

Inventors

Rajeev Rikhye · Mountain View, US
Quan Wang · Hoboken, US
Yanzhang He · Mountain View, US
Qiao Liang · Mountain View, US
Ian C. McGraw · Menlo Park, US

Key dates

Filing date	Jan 30, 2023
Grant date	Jul 9, 2024
Priority date	—
Expiry date	Jan 30, 2043

Classification

Technology area (CPC G)Physics
CPC primaryG10L2015/088
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Techniques disclosed herein are directed towards streaming keyphrase detection which can be customized to detect one or more particular keyphrases, without requiring retraining of any model(s) for those particular keyphrase(s). Many implementations include processing audio data using a speaker separation model to generate separated audio data which isolates an utterance spoken by a human speaker from one or more additional sounds not spoken by the human speaker, and processing the separated audio data using a text independent speaker identification model to determine whether a verified and/or registered user spoke a spoken utterance captured in the audio data. Various implementations include processing the audio data and/or the separated audio data using an automatic speech recognition model to generate a text representation of the utterance. Additionally or alternatively, the text representation of the utterance can be processed to determine whether at least a portion of the text representation of the utterance captures a particular keyphrase. When the system determines the registered and/or verified user spoke the utterance and the system determines the text representation of the…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.