Patent · US Active

Generating audio using auto-regressive generative neural networks

US11915689B1 · kind B1 · utility

1Cited by
6References
30Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 7, 2023
Grant dateFeb 27, 2024
Priority date
Expiry dateSep 7, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L21/0272
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a prediction of an audio signal. One of the methods includes receiving a request to generate an audio signal conditioned on an input; processing the input using an embedding neural network to map the input to one or more embedding tokens; generating a semantic representation of the audio signal; generating, using one or more generative neural networks and conditioned on at least the semantic representation and the embedding tokens, an acoustic representation of the audio signal; and processing at least the acoustic representation using a decoder neural network to generate the prediction of the audio signal.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.