Patent · US Active

End-to-end streaming speech translation with neural transducer

US12277927B2 · kind B2 · utility

0Cited by
1References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 15, 2022
Grant dateApr 15, 2025
Priority date
Expiry dateFeb 24, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L15/22
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Systems and methods are provided for obtaining, training, and using an end-to-end AST model based on a neural transducer, the end-to-end AST model comprising at least (i) an acoustic encoder which is configured to receive and encode audio data, (ii) a prediction network which is integrated in a parallel model architecture with the acoustic encoder in the end-to-end AST model, and (iii) a joint layer which is integrated in series with the acoustic encoder and prediction network. The end-to-end AST model is configured to generate a transcription in the second language of input audio data in the first language such that the acoustic encoder learns a plurality of temporal processing paths.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.