Patent · US Active

Language-agnostic multilingual modeling using effective script normalization

US11615779B2 · kind B2 · utility

0Cited by

4References

26Claims

0Family size

Assignee

Google LLC · US

Inventors

Arindrima Datta · New York, US
Bhuvana Ramabhadran · Campion Road, US
Jesse Emond · Mountain View, US
Brian E. Roark · Portland, US

Key dates

Filing date	Jan 19, 2021
Grant date	Mar 28, 2023
Priority date	—
Expiry date	Mar 12, 2041

Classification

Technology area (CPC G)Physics
CPC primaryG06F40/53
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method includes obtaining a plurality of training data sets each associated with a respective native language and includes a plurality of respective training data samples. For each respective training data sample of each training data set in the respective native language, the method includes transliterating the corresponding transcription in the respective native script into corresponding transliterated text representing the respective native language of the corresponding audio in a target script and associating the corresponding transliterated text in the target script with the corresponding audio in the respective native language to generate a respective normalized training data sample. The method also includes training, using the normalized training data samples, a multilingual end-to-end speech recognition model to predict speech recognition results in the target script for corresponding speech utterances spoken in any of the different native languages associated with the plurality of training data sets.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.