Patent · US Active

Speech recognition using convolutional neural networks

US10586531B2 · kind B2 · utility

6Cited by

10References

21Claims

0Family size

Assignee

DeepMind Technologies Limited · GB

Inventors

Aaron Gerard Antonius van den Oord · London, GB
Sander Etienne Lea Dieleman · London, GB
Nal Emmerich Kalchbrenner · London, GB
Karen Simonyan · London, GB
Oriol Vinyals · London, GB
Lasse Espeholt · Amsterdam, NL

Key dates

Filing date	Dec 4, 2018
Grant date	Mar 10, 2020
Priority date	—
Expiry date	Dec 4, 2038

Classification

Technology area (CPC G)Physics
CPC primaryG10L2015/025
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing speech recognition by generating a neural network output from an audio data input sequence, where the neural network output characterizes words spoken in the audio data input sequence. One of the methods includes, for each of the audio data inputs, providing a current audio data input sequence that comprises the audio data input and the audio data inputs preceding the audio data input in the audio data input sequence to a convolutional subnetwork comprising a plurality of dilated convolutional neural network layers, wherein the convolutional subnetwork is configured to, for each of the plurality of audio data inputs: receive the current audio data input sequence for the audio data input, and process the current audio data input sequence to generate an alternative representation for the audio data input.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.