Patent · US Active

Using context information with end-to-end models for speech recognition

US11545142B2 · kind B2 · utility

0Cited by
4References
28Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 24, 2020
Grant dateJan 3, 2023
Priority date
Expiry dateJun 22, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2015/228
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method includes receiving audio data encoding an utterance, processing, using a speech recognition model, the audio data to generate speech recognition scores for speech elements, and determining context scores for the speech elements based on context data indicating a context for the utterance. The method also includes executing, using the speech recognition scores and the context scores, a beam search decoding process to determine one or more candidate transcriptions for the utterance. The method also includes selecting a transcription for the utterance from the one or more candidate transcriptions.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.