Patent · US Active

Efficient streaming non-recurrent on-device end-to-end model

US11715458B2 · kind B2 · utility

1Cited by

0References

20Claims

0Family size

Assignee

Google LLC · US

Inventors

Tara N. Sainath · Jersey City, US
Arun Narayanan · Rochester Hills, US
Rami Botros · Mountain View, US
Yanzhang He · Mountain View, US
Ehsan Variani · Mountain View, US
Cyril Georges Luc Allauzen · New York, US
David Rybach · Aachen, DE
Ruoming Pang · New York, US
Trevor Strohman · Sunnyvale, US

Key dates

Filing date	May 10, 2021
Grant date	Aug 1, 2023
Priority date	—
Expiry date	Oct 8, 2041

Classification

Technology area (CPC G)Physics
CPC primaryG10L15/32
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

An ASR model includes a first encoder configured to receive a sequence of acoustic frames and generate a first higher order feature representation for a corresponding acoustic frame in the sequence of acoustic frames. The ASR model also includes a second encoder configured to receive the first higher order feature representation generated by the first encoder at each of the plurality of output steps and generate a second higher order feature representation for a corresponding first higher order feature frame. The ASR model also includes a decoder configured to receive the second higher order feature representation generated by the second encoder at each of the plurality of output steps and generate a first probability distribution over possible speech recognition hypothesis. The ASR model also includes a language model configured to receive the first probability distribution over possible speech hypothesis and generate a rescored probability distribution.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.