Method and apparatus for neural network quantization
US12393828B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Feb 9, 2024 |
| Grant date | Aug 19, 2025 |
| Priority date | — |
| Expiry date | Feb 9, 2044 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N3/09
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
According to a method and apparatus for neural network quantization, a quantized neural network is generated by performing learning of a neural network, obtaining weight differences between an initial weight and an updated weight determined by the learning of each cycle for each of layers in the first neural network, analyzing a statistic of the weight differences for each of the layers, determining one or more layers, from among the layers, to be quantized with a lower-bit precision based on the analyzed statistic, and generating a second neural network by quantizing the determined one or more layers with the lower-bit precision.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.