Patent · US Active

Audio signal encoding and decoding method using a neural network model to generate a quantized latent vector, and encoder and decoder for performing the same

US12205605B2 · kind B2 · utility

0Cited by
4References
13Claims
0Family size

Assignees

Inventors

Key dates

Filing dateFeb 11, 2022
Grant dateJan 21, 2025
Priority date
Expiry dateOct 10, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG10L2019/0005
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

An audio signal encoding and decoding method using a neural network model, and an encoder and decoder for performing the same are disclosed. A method of encoding an audio signal using a neural network model, the method may include identifying an input signal, generating a quantized latent vector by inputting the input signal into a neural network model encoding the input signal, and generating a bitstream corresponding to the quantized latent vector, wherein the neural network model may include i) a feature extraction layer generating a latent vector by extracting a feature of the input signal, ii) a plurality of downsampling blocks downsampling the latent vector, and iii) a plurality of quantization blocks performing quantization of a downsampled latent vector.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.