Patent · US Active

Generating audio using auto-regressive generative neural networks

US11915689B1 · kind B1 · utility

1Cited by

6References

30Claims

0Family size

Assignee

Google LLC · US

Inventors

Andrea Agostinelli · Zürich, CH
Timo Immanuel Denk · Berlin, DE
Antoine Caillon · Paris, FR
Neil Zeghidour · Paris, FR
Jesse Engel · Oakland, US
Mauro Verzetti · Stintenberger Straße, CH
Christian Frank · Zürich, CH
Zalán Borsos · Zürich, CH
Matthew Sharifi · Adliswil, CH
Adam Joseph Roberts · Durham, US
Marco Tagliasacchi · Lugano, CH

Key dates

Filing date	Sep 7, 2023
Grant date	Feb 27, 2024
Priority date	—
Expiry date	Sep 7, 2043

Classification

Technology area (CPC G)Physics
CPC primaryG10L21/0272
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a prediction of an audio signal. One of the methods includes receiving a request to generate an audio signal conditioned on an input; processing the input using an embedding neural network to map the input to one or more embedding tokens; generating a semantic representation of the audio signal; generating, using one or more generative neural networks and conditioned on at least the semantic representation and the embedding tokens, an acoustic representation of the audio signal; and processing at least the acoustic representation using a decoder neural network to generate the prediction of the audio signal.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.