Method and apparatus for quantizing deep neural network
US12099915B2 · kind B2 · utility
Assignee
Inventor
Key dates
| Filing date | Apr 13, 2022 |
| Grant date | Sep 24, 2024 |
| Priority date | — |
| Expiry date | Apr 13, 2042 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N3/08
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
A method for quantizing a deep neural network is provided, which includes extracting first statistical information on output values of a first normalization layer included in the deep neural network, determining a discretization interval associated with input values of a subsequent layer of the first normalization layer by using the extracted first statistical information, and quantizing the input values of the subsequent layer into discretized values having the determined discretization interval.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.