Patent · US Active

Recognition of out-of-vocabulary in direct acoustics-to-word speech recognition using acoustic word embedding

US10839792B2 · kind B2 · utility

0Cited by

12References

25Claims

0Family size

Assignees

Inventors

Kartik Audhkhasi · White Plains, US
Karen Livescu · Yorktown Heights, US
Michael A. Picheny · White Plains, US
Shane Settle · Moraga, US

Key dates

Filing date	Feb 5, 2019
Grant date	Nov 17, 2020
Priority date	—
Expiry date	Jun 12, 2039

Classification

Technology area (CPC G)Physics
CPC primaryG10L2015/088
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method (and structure and computer product) for learning Out-of-Vocabulary (OOV) words in an Automatic Speech Recognition (ASR) system includes using an Acoustic Word Embedding Recurrent Neural Network (AWE RNN) to receive a character sequence for a new OOV word for the ASR system, the RNN providing an Acoustic Word Embedding (AWE) vector as an output thereof. The AWE vector output from the AWE RNN is provided as an input into an Acoustic Word Embedding-to-Acoustic-to-Word Neural Network (AWE→A2W NN) trained to provide an OOV word weight value from the AWE vector. The OOV word weight is inserted into a listing of Acoustic-to-Word (A2W) word embeddings used by the ASR system to output recognized words from an input of speech acoustic features, wherein the OOV word weight is inserted into the A2W word embeddings list relative to existing weights in the A2W word embeddings list.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.