Patent · US Active

Generating representations of acoustic sequences

US10134393B2 · kind B2 · utility

3Cited by

5References

20Claims

0Family size

Assignee

Google LLC · US

Inventors

Hasim Sak · New York, US
Andrew W. Senior · New York, US

Key dates

Filing date	Jul 31, 2017
Grant date	Nov 20, 2018
Priority date	—
Expiry date	Jul 31, 2037

Classification

Technology area (CPC G)Physics
CPC primaryG10L2015/025
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating representation of acoustic sequences. One of the methods includes: receiving an acoustic sequence, the acoustic sequence comprising a respective acoustic feature representation at each of a plurality of time steps; processing the acoustic feature representation at an initial time step using an acoustic modeling neural network; for each subsequent time step of the plurality of time steps: receiving an output generated by the acoustic modeling neural network for a preceding time step, generating a modified input from the output generated by the acoustic modeling neural network for the preceding time step and the acoustic representation for the time step, and processing the modified input using the acoustic modeling neural network to generate an output for the time step; and generating a phoneme representation for the utterance from the outputs for each of the time steps.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.