End-to-end streaming speech translation with neural transducer
US12277927B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 15, 2022 |
| Grant date | Apr 15, 2025 |
| Priority date | — |
| Expiry date | Feb 24, 2043 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L15/22
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Systems and methods are provided for obtaining, training, and using an end-to-end AST model based on a neural transducer, the end-to-end AST model comprising at least (i) an acoustic encoder which is configured to receive and encode audio data, (ii) a prediction network which is integrated in a parallel model architecture with the acoustic encoder in the end-to-end AST model, and (iii) a joint layer which is integrated in series with the acoustic encoder and prediction network. The end-to-end AST model is configured to generate a transcription in the second language of input audio data in the first language such that the acoustic encoder learns a plurality of temporal processing paths.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.