Patent · US Active

Encoder-decoder models for sequence to sequence mapping

US10706840B2 · kind B2 · utility

9Cited by

52References

17Claims

0Family size

Assignee

Google LLC · US

Inventors

Hasim Sak · New York, US
Sean Matthew Shannon · Mountain View, US

Key dates

Filing date	Dec 19, 2017
Grant date	Jul 7, 2020
Priority date	—
Expiry date	Mar 9, 2038

Classification

Technology area (CPC G)Physics
CPC primaryG10L2015/025
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Methods, systems, and apparatus for performing speech recognition. In some implementations, acoustic data representing an utterance is obtained. The acoustic data corresponds to time steps in a series of time steps. One or more computers process scores indicative of the acoustic data using a recurrent neural network to generate a sequence of outputs. The sequence of outputs indicates a likely output label from among a predetermined set of output labels. The predetermined set of output labels includes output labels that respectively correspond to different linguistic units and to a placeholder label that does not represent a classification of acoustic data. The recurrent neural network is configured to use an output label indicated for a previous time step to determine an output label for the current time step. The generated sequence of outputs is processed to generate a transcription of the utterance, and the transcription of the utterance is provided.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.