Patent · US Active

Methods of encoding and decoding speech signal using neural network model recognizing sound sources, and encoding and decoding apparatuses for performing the same

US11664037B2 · kind B2 · utility

0Cited by

5References

4Claims

0Family size

Assignees

Inventors

Woo-taek LIM · Daejeon, KR
Seung Kwon BEACK · Seoul, KR
Jongmo SUNG · Daejeon, KR
Mi Suk LEE · Daejeon, KR
Tae Jin LEE · Daejeon, KR
Inseon JANG · Daejeon, KR
Minje Kim · Hwaseong-si, KR
Haici YANG · Bloomington, US

Key dates

Filing date	May 20, 2021
Grant date	May 30, 2023
Priority date	—
Expiry date	May 20, 2041

Classification

Technology area (CPC G)Physics
CPC primaryG10L25/30
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

Methods of encoding and decoding a speech signal using a neural network model that recognizes sound sources, and encoding and decoding apparatuses for performing the methods are provided. A method of encoding a speech signal includes identifying an input signal for a plurality of sound sources; generating a latent signal by encoding the input signal; obtaining a plurality of sound source signals by separating the latent signal for each of the plurality of sound sources; determining a number of bits used for quantization of each of the plurality of sound source signals according to a type of each of the plurality of sound sources; quantizing each of the plurality of sound source signals based on the determined number of bits; and generating a bitstream by combining the plurality of quantized sound source signals.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.