Patent · US Active

Canonical training for highly configurable multilingual speech

US12249336B2 · kind B2 · utility

1Cited by

8References

20Claims

0Family size

Assignee

MICROSOFT TECHNOLOGY LICENSING, LLC · US

Inventors

Jinyu Li · Beijing, CN
Long Zhou · Shanghai, CN
Xie Sun · Florham Park, US
Shujie Liu · Cupertino, US

Key dates

Filing date	Jun 29, 2021
Grant date	Mar 11, 2025
Priority date	—
Expiry date	Jun 29, 2041

Classification

Technology area (CPC G)Physics
CPC primaryG10L2015/0635
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Embodiments are provided for building a configurable multilingual model. A computing system obtains a plurality of language-specific automatic speech recognition modules and a universal automatic speech recognition module trained on a multi-language training dataset comprising training data corresponding to each of the plurality of different languages. The computing system then compiles the universal automatic speech recognition module with the plurality of language-specific automatic speech recognition modules to generate a configurable multilingual model that is configured to selectively and dynamically utilize a sub-set of the plurality of language-specific automatic speech recognition modules with the universal automatic speech recognition module to process audio content in response to user input identifying one or more target languages associated with the audio content.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.