Patent · US Active

Transducer-based streaming deliberation for cascaded encoders

US12118988B2 · kind B2 · utility

0Cited by

0References

24Claims

0Family size

Assignee

Google LLC · US

Inventors

Ke Hu · Stony Brook, US
Tara N. Sainath · Jersey City, US
Arun Narayanan · Rochester Hills, US
Ruoming Pang · New York, US
Trevor Strohman · Sunnyvale, US

Key dates

Filing date	Sep 19, 2022
Grant date	Oct 15, 2024
Priority date	—
Expiry date	May 31, 2043

Classification

Technology area (CPC G)Physics
CPC primaryG06N3/048
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method includes receiving a sequence of acoustic frames and generating, by a first encoder, a first higher order feature representation for a corresponding acoustic frame in the sequence of acoustic frames. The method also includes generating, by a first pass transducer decoder, a first pass speech recognition hypothesis for a corresponding first higher order feature representation and generating, by a text encoder, a text encoding for a corresponding first pass speech recognition hypothesis. The method also includes generating, by a second encoder, a second higher order feature representation for a corresponding first higher order feature representation. The method also includes generating, by a second pass transducer decoder, a second pass speech recognition hypothesis using a corresponding second higher order feature representation and a corresponding text encoding.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.