Patent · US Active

Inverted projection for robust speech translation

US12406659B2 · kind B2 · utility

0Cited by
1References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJul 7, 2022
Grant dateSep 2, 2025
Priority date
Expiry dateJun 4, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L15/28
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The technology provides an approach to train translation models that are robust to transcription errors and punctuation errors. The approach includes introducing errors from actual automatic speech recognition and automatic punctuation systems into the source side of the machine translation training data. A method for training a machine translation model includes performing automatic speech recognition on input source audio to generate a system transcript. The method aligns a human transcript of the source audio to the system transcript, including projecting system segmentation onto the human transcript. Then the method performs segment robustness training of a machine translation model according to the aligned human and system transcripts, and performs system robustness training of the machine translation model, e.g., by injecting token errors into training data.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.