Patent · US Active

Generating audio using neural networks

US10304477B2 · kind B2 · utility

10Cited by

7References

30Claims

0Family size

Assignee

DeepMind Technologies Limited · GB

Inventors

Aaron Gerard Antonius van den Oord · London, GB
Sander Etienne Lea Dieleman · London, GB
Nal Emmerich Kalchbrenner · London, GB
Karen Simonyan · London, GB
Oriol Vinyals · London, GB

Key dates

Filing date	Jul 9, 2018
Grant date	May 28, 2019
Priority date	—
Expiry date	Jul 9, 2038

Classification

Technology area (CPC G)Physics
CPC primaryG10H2250/311
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating an output sequence of audio data that comprises a respective audio sample at each of a plurality of time steps. One of the methods includes, for each of the time steps: providing a current sequence of audio data as input to a convolutional subnetwork, wherein the current sequence comprises the respective audio sample at each time step that precedes the time step in the output sequence, and wherein the convolutional subnetwork is configured to process the current sequence of audio data to generate an alternative representation for the time step; and providing the alternative representation for the time step as input to an output layer, wherein the output layer is configured to: process the alternative representation to generate an output that defines a score distribution over a plurality of possible audio samples for the time step.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.