Speech processing using a recurrent neural network
US11205420B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Jun 10, 2019 |
| Grant date | Dec 21, 2021 |
| Priority date | — |
| Expiry date | Jan 2, 2040 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/088
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A system and method performs wakeword detection using a neural network model that includes a recurrent neural network (RNN) for processing variable-length wakewords. To prevent the model from being influenced by non-wakeword speech, multiple instances of the model are created to process audio data, and each instance is configured to use weights determined by training data. The model may instead or in addition be used to process the audio data only when a likelihood that the audio data corresponds to the wakeword is greater than a threshold. The model may process the audio data as represented by groups of acoustic feature vectors; computations for feature vectors common to different groups may be re-used.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.