Patent · US Active

Speech processing using a recurrent neural network

US11205420B1 · kind B1 · utility

10Cited by

0References

20Claims

0Family size

Assignee

AMAZON TECHNOLOGIES, INC. · US

Inventors

Gengshen Fu · Sharon, US
Thibaud Senechal · Cambridge, US
Shiv Naga Prasad Vitaladevuni · Cambridge, US
Michael James Rodehorst · Belmont, US
Varun Kumar Nagaraja · Hyattsville, US

Key dates

Filing date	Jun 10, 2019
Grant date	Dec 21, 2021
Priority date	—
Expiry date	Jan 2, 2040

Classification

Technology area (CPC G)Physics
CPC primaryG10L2015/088
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A system and method performs wakeword detection using a neural network model that includes a recurrent neural network (RNN) for processing variable-length wakewords. To prevent the model from being influenced by non-wakeword speech, multiple instances of the model are created to process audio data, and each instance is configured to use weights determined by training data. The model may instead or in addition be used to process the audio data only when a likelihood that the audio data corresponds to the wakeword is greater than a threshold. The model may process the audio data as represented by groups of acoustic feature vectors; computations for feature vectors common to different groups may be re-used.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.