Patent · US Active

Multi-accent speech recognition

US10431206B2 · kind B2 · utility

1Cited by

0References

20Claims

0Family size

Assignee

Google LLC · US

Inventors

Hasim Sak · New York, US
Kanury Kanishka Rao · Santa Clara, US

Key dates

Filing date	Aug 22, 2016
Grant date	Oct 1, 2019
Priority date	—
Expiry date	Apr 20, 2037

Classification

Technology area (CPC G)Physics
CPC primaryG10L2015/025
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media for training a hierarchical recurrent neural network (HRNN) having a plurality of parameters on a plurality of training acoustic sequences to generate phoneme representations of received acoustic sequences. One method includes, for each of the received training acoustic sequences: processing the received acoustic sequence in accordance with current values of the parameters of the HRNN to generate a predicted grapheme representation of the received acoustic sequence; processing an intermediate output generated by an intermediate layer of the HRNN during the processing of the received acoustic sequence to generate one or more predicted phoneme representations of the received acoustic sequence; and adjusting the current values of the parameters of the HRNN based at (i) the predicted grapheme representation and (ii) the one or more predicted phoneme representations.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.