Patent · US Active

Generating audio using auto-regressive generative neural networks

US12322380B2 · kind B2 · utility

1Cited by

6References

20Claims

0Family size

Assignee

Google LLC · US

Inventors

Andrea Agostinelli · Zürich, CH
Timo Immanuel Denk · Berlin, DE
Antoine Caillon · Paris, FR
Neil Zeghidour · Paris, FR
Jesse Engel · Oakland, US
Mauro Verzetti · Stintenberger Straße, CH
Christian Frank · Zürich, CH
Zalán Borsos · Zürich, CH
Matthew Sharifi · Adliswil, CH
Adam Joseph Roberts · Durham, US
Marco Tagliasacchi · Lugano, CH

Key dates

Filing date	Jan 12, 2024
Grant date	Jun 3, 2025
Priority date	—
Expiry date	Jan 12, 2044

Classification

Technology area (CPC G)Physics
CPC primaryG10L21/0272
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a prediction of an audio signal. One of the methods includes receiving a request to generate an audio signal conditioned on an input; processing the input using an embedding neural network to map the input to one or more embedding tokens; generating a semantic representation of the audio signal; generating, using one or more generative neural networks and conditioned on at least the semantic representation and the embedding tokens, an acoustic representation of the audio signal; and processing at least the acoustic representation using a decoder neural network to generate the prediction of the audio signal.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.