Method and apparatus for compressing neural network model
US11861498B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Oct 18, 2022 |
| Grant date | Jan 2, 2024 |
| Priority date | — |
| Expiry date | Oct 18, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N3/0495
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method for compressing a neural network model includes acquiring a to-be-compressed neural network model. A first bit width, a second bit width and a target thinning rate corresponding to the to-be-compressed neural network model are determined. A target value is obtained according to the first bit width, the second bit width and the target thinning rate. Then the to-be-compressed neural network model is compressed using the target value, the first bit width and the second bit width to obtain a compression result of the to-be-compressed neural network model.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.