Patent · US Active

Intended query detection using E2E modeling for continued conversation

US12315497B2 · kind B2 · utility

0Cited by
16References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 20, 2023
Grant dateMay 27, 2025
Priority date
Expiry dateDec 1, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2015/223
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method includes receiving, as input to a speech recognition model, audio data corresponding to a spoken utterance. The method also includes performing, using the speech recognition model, speech recognition on the audio data by, at each of a plurality of time steps, encoding, using an audio encoder, the audio data corresponding to the spoken utterance into a corresponding audio encoding, and decoding, using a speech recognition joint network, the corresponding audio encoding into a probability distribution over possible output labels. At each of the plurality of time steps, the method also includes determining, using an intended query (IQ) joint network configured to receive a label history representation associated with a sequence of non-blank symbols output by a final softmax layer, an intended query decision indicating whether or not the spoken utterance includes a query intended for a digital assistant.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.