Patent · US Active

Canonical training for highly configurable multilingual speech

US12249336B2 · kind B2 · utility

1Cited by
8References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJun 29, 2021
Grant dateMar 11, 2025
Priority date
Expiry dateJun 29, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2015/0635
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Embodiments are provided for building a configurable multilingual model. A computing system obtains a plurality of language-specific automatic speech recognition modules and a universal automatic speech recognition module trained on a multi-language training dataset comprising training data corresponding to each of the plurality of different languages. The computing system then compiles the universal automatic speech recognition module with the plurality of language-specific automatic speech recognition modules to generate a configurable multilingual model that is configured to selectively and dynamically utilize a sub-set of the plurality of language-specific automatic speech recognition modules with the universal automatic speech recognition module to process audio content in response to user input identifying one or more target languages associated with the audio content.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.