Patent · US Active

Context-dependent modeling of phonemes

US9818409B2 · kind B2 · utility

8Cited by

6References

17Claims

0Family size

Assignee

Google LLC · US

Inventors

Andrew W. Senior · New York, US
Hasim Sak · New York, US
Izhak Shafran · Menlo Park, US

Key dates

Filing date	Oct 7, 2015
Grant date	Nov 14, 2017
Priority date	—
Expiry date	Oct 7, 2035

Classification

Technology area (CPC G)Physics
CPC primaryG10L2015/025
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media for modeling phonemes. One method includes receiving an acoustic sequence, the acoustic sequence representing an utterance, and the acoustic sequence comprising a respective acoustic feature representation at each of a plurality of time steps; for each of the plurality of time steps: processing the acoustic feature representation through each of one or more recurrent neural network layers to generate a recurrent output; processing the recurrent output using a softmax output layer to generate a set of scores, the set of scores comprising a respective score for each of a plurality of context dependent vocabulary phonemes, the score for each context dependent vocabulary phoneme representing a likelihood that the context dependent vocabulary phoneme represents the utterance at the time step; and determining, from the scores for the plurality of time steps, a context dependent phoneme representation of the sequence.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.