Patent · US Active

Method for compressing neural network model and electronic apparatus for performing the same

US12198040B2 · kind B2 · utility

0Cited by
0References
20Claims
0Family size

Assignee

Inventor

Key dates

Filing dateMay 5, 2023
Grant dateJan 14, 2025
Priority date
Expiry dateMay 22, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06V10/764
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method for compressing a neural network model is disclosed. The method for compressing a neural network model includes receiving, at a processor of the electronic apparatus, an original model including a plurality of layers each including a plurality of filters, a compression ratio to be applied to the original model, and a metric for determining an importance of the plurality of filters, determining the importance of the plurality of filters using the metric, normalizing the importance of the plurality of filters layer by layer, and compressing the original model by removing at least one filter among the plurality of filters based on the normalized importance and the compression ratio.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.