Patent · US Active

Speech recognition with acoustic models

US9818410B2 · kind B2 · utility

7Cited by

6References

20Claims

0Family size

Assignee

Google LLC · US

Inventors

Hasim Sak · New York, US
Andrew W. Senior · New York, US

Key dates

Filing date	Dec 29, 2015
Grant date	Nov 14, 2017
Priority date	—
Expiry date	Dec 29, 2035

Classification

Technology area (CPC G)Physics
CPC primaryG10L2015/025
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media for learning pronunciations from acoustic sequences. One method includes receiving an acoustic sequence, the acoustic sequence representing an utterance, and the acoustic sequence comprising a sequence of multiple frames of acoustic data at each of a plurality of time steps; stacking one or more frames of acoustic data to generate a sequence of modified frames of acoustic data; processing the sequence of modified frames of acoustic data through an acoustic modeling neural network comprising one or more recurrent neural network (RNN) layers and a final CTC output layer to generate a neural network output, wherein processing the sequence of modified frames of acoustic data comprises: subsampling the modified frames of acoustic data; and processing each subsampled modified frame of acoustic data through the acoustic modeling neural network.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.