Patent · US Active

Method for compressing neural network model and electronic apparatus for performing the same

US12198040B2 · kind B2 · utility

0Cited by

0References

20Claims

0Family size

Assignee

NOTA, INC. · KR

Inventor

Kyunghwan Shim · Daejeon, KR

Key dates

Filing date	May 5, 2023
Grant date	Jan 14, 2025
Priority date	—
Expiry date	May 22, 2043

Classification

Technology area (CPC G)Physics
CPC primaryG06V10/764
WIPO fieldComputer technology
WIPO sectorElectrical engineering

Abstract

A method for compressing a neural network model is disclosed. The method for compressing a neural network model includes receiving, at a processor of the electronic apparatus, an original model including a plurality of layers each including a plurality of filters, a compression ratio to be applied to the original model, and a metric for determining an importance of the plurality of filters, determining the importance of the plurality of filters using the metric, normalizing the importance of the plurality of filters layer by layer, and compressing the original model by removing at least one filter among the plurality of filters based on the normalized importance and the compression ratio.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.