Patent · US Active

Low precision and coarse-to-fine dynamic fixed-point quantization design in convolution neural network

US12014273B2 · kind B2 · utility

1Cited by
0References
6Claims
0Family size

Assignee

Inventors

Key dates

Filing dateAug 27, 2019
Grant dateJun 18, 2024
Priority date
Expiry dateAug 16, 2042

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/063
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

After inputting input data to a floating pre-trained convolution neural network to generate floating feature maps for each layer of the floating pre-trained CNN model, a statistical analysis on the floating feature maps is performed to generate a dynamic quantization range for each layer of the floating pre-trained CNN model. Based on the obtained quantization range for each layer, the proposed quantization methodologies quantize the floating pre-trained CNN model to generate the scalar factor of each layer and the fractional bit-width of a quantized CNN model. It enables the inference engine performs low-precision fixed-point arithmetic operations to generate a fixed-point inferred CNN model.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.