Recognition of out-of-vocabulary in direct acoustics-to-word speech recognition using acoustic word embedding
US10839792B2 · kind B2 · utility
Assignees
Inventors
Key dates
| Filing date | Feb 5, 2019 |
| Grant date | Nov 17, 2020 |
| Priority date | — |
| Expiry date | Jun 12, 2039 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2015/088
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method (and structure and computer product) for learning Out-of-Vocabulary (OOV) words in an Automatic Speech Recognition (ASR) system includes using an Acoustic Word Embedding Recurrent Neural Network (AWE RNN) to receive a character sequence for a new OOV word for the ASR system, the RNN providing an Acoustic Word Embedding (AWE) vector as an output thereof. The AWE vector output from the AWE RNN is provided as an input into an Acoustic Word Embedding-to-Acoustic-to-Word Neural Network (AWE→A2W NN) trained to provide an OOV word weight value from the AWE vector. The OOV word weight is inserted into a listing of Acoustic-to-Word (A2W) word embeddings used by the ASR system to output recognized words from an input of speech acoustic features, wherein the OOV word weight is inserted into the A2W word embeddings list relative to existing weights in the A2W word embeddings list.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.