Patent · US Active

Recognition of out-of-vocabulary in direct acoustics-to-word speech recognition using acoustic word embedding

US10839792B2 · kind B2 · utility

0Cited by
12References
25Claims
0Family size

Assignees

Inventors

Key dates

Filing dateFeb 5, 2019
Grant dateNov 17, 2020
Priority date
Expiry dateJun 12, 2039

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2015/088
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method (and structure and computer product) for learning Out-of-Vocabulary (OOV) words in an Automatic Speech Recognition (ASR) system includes using an Acoustic Word Embedding Recurrent Neural Network (AWE RNN) to receive a character sequence for a new OOV word for the ASR system, the RNN providing an Acoustic Word Embedding (AWE) vector as an output thereof. The AWE vector output from the AWE RNN is provided as an input into an Acoustic Word Embedding-to-Acoustic-to-Word Neural Network (AWE→A2W NN) trained to provide an OOV word weight value from the AWE vector. The OOV word weight is inserted into a listing of Acoustic-to-Word (A2W) word embeddings used by the ASR system to output recognized words from an input of speech acoustic features, wherein the OOV word weight is inserted into the A2W word embeddings list relative to existing weights in the A2W word embeddings list.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.