Generating audio using auto-regressive generative neural networks
US12322380B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Jan 12, 2024 |
| Grant date | Jun 3, 2025 |
| Priority date | — |
| Expiry date | Jan 12, 2044 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L21/0272
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a prediction of an audio signal. One of the methods includes receiving a request to generate an audio signal conditioned on an input; processing the input using an embedding neural network to map the input to one or more embedding tokens; generating a semantic representation of the audio signal; generating, using one or more generative neural networks and conditioned on at least the semantic representation and the embedding tokens, an acoustic representation of the audio signal; and processing at least the acoustic representation using a decoder neural network to generate the prediction of the audio signal.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.