Patent · US Active

Method and apparatus for quantizing deep neural network

US12099915B2 · kind B2 · utility

1Cited by
0References
11Claims
0Family size

Assignee

Inventor

Key dates

Filing dateApr 13, 2022
Grant dateSep 24, 2024
Priority date
Expiry dateApr 13, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/08
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method for quantizing a deep neural network is provided, which includes extracting first statistical information on output values of a first normalization layer included in the deep neural network, determining a discretization interval associated with input values of a subsequent layer of the first normalization layer by using the extracted first statistical information, and quantizing the input values of the subsequent layer into discretized values having the determined discretization interval.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.