Method for compressing neural network model and electronic apparatus for performing the same
US12198040B2 · kind B2 · utility
Assignee
Inventor
Key dates
| Filing date | May 5, 2023 |
| Grant date | Jan 14, 2025 |
| Priority date | — |
| Expiry date | May 22, 2043 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06V10/764
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method for compressing a neural network model is disclosed. The method for compressing a neural network model includes receiving, at a processor of the electronic apparatus, an original model including a plurality of layers each including a plurality of filters, a compression ratio to be applied to the original model, and a metric for determining an importance of the plurality of filters, determining the importance of the plurality of filters using the metric, normalizing the importance of the plurality of filters layer by layer, and compressing the original model by removing at least one filter among the plurality of filters based on the normalized importance and the compression ratio.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.