Patent · US Active

End-to-end streaming speech translation with neural transducer

US12277927B2 · kind B2 · utility

0Cited by

1References

20Claims

0Family size

Assignee

MICROSOFT TECHNOLOGY LICENSING, LLC · US

Inventors

Jinyu Li · Beijing, CN
Jian Xue · Suzhou, CN
Matthew Post · Ruston, US
Peidong Wang · Columbus, US

Key dates

Filing date	Mar 15, 2022
Grant date	Apr 15, 2025
Priority date	—
Expiry date	Feb 24, 2043

Classification

Technology area (CPC G)Physics
CPC primaryG10L15/22
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Systems and methods are provided for obtaining, training, and using an end-to-end AST model based on a neural transducer, the end-to-end AST model comprising at least (i) an acoustic encoder which is configured to receive and encode audio data, (ii) a prediction network which is integrated in a parallel model architecture with the acoustic encoder in the end-to-end AST model, and (iii) a joint layer which is integrated in series with the acoustic encoder and prediction network. The end-to-end AST model is configured to generate a transcription in the second language of input audio data in the first language such that the acoustic encoder learns a plurality of temporal processing paths.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.