Audio signal encoding and decoding method using a neural network model to generate a quantized latent vector, and encoder and decoder for performing the same
US12205605B2 · kind B2 · utility
Assignees
Inventors
Key dates
| Filing date | Feb 11, 2022 |
| Grant date | Jan 21, 2025 |
| Priority date | — |
| Expiry date | Oct 10, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG10L2019/0005
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
An audio signal encoding and decoding method using a neural network model, and an encoder and decoder for performing the same are disclosed. A method of encoding an audio signal using a neural network model, the method may include identifying an input signal, generating a quantized latent vector by inputting the input signal into a neural network model encoding the input signal, and generating a bitstream corresponding to the quantized latent vector, wherein the neural network model may include i) a feature extraction layer generating a latent vector by extracting a feature of the input signal, ii) a plurality of downsampling blocks downsampling the latent vector, and iii) a plurality of quantization blocks performing quantization of a downsampled latent vector.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.