Patent · US Active

Method and apparatus for multi-lingual end-to-end speech recognition

US10593321B2 · kind B2 · utility

4Cited by

1References

18Claims

0Family size

Assignee

Mitsubishi Electric Research Laboratories, Inc. · US

Inventors

Shinji Watanabe · Minato, JP
Takaaki Hori · Lexington, US
Hiroshi Seki · Hitachi, JP
Jonathan Le Roux · Somerville, US
John R. Hershey · Winchester, US

Key dates

Filing date	Dec 15, 2017
Grant date	Mar 17, 2020
Priority date	—
Expiry date	Dec 23, 2037

Classification

Technology area (CPC G)Physics
CPC primaryG10L15/005
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method for training a multi-language speech recognition network includes providing utterance datasets corresponding to predetermined languages, inserting language identification (ID) labels into the utterance datasets, wherein each of the utterance datasets is labelled by each of the language ID labels, concatenating the labeled utterance datasets, generating initial network parameters from the utterance datasets, selecting the initial network parameters according to a predetermined sequence, and training, iteratively, an end-to-end network with a series of the selected initial network parameters and the concatenated labeled utterance datasets until a training result reaches a threshold.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.