Patent · US Active

Per kernel Kmeans compression for neural networks

US11055604B2 · kind B2 · utility

0Cited by
2References
10Claims
0Family size

Assignee

Inventors

Key dates

Filing dateSep 12, 2017
Grant dateJul 6, 2021
Priority date
Expiry dateApr 22, 2040

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/088
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Methods and apparatus relating to techniques for incremental network quantization. In an example, an apparatus comprises logic, at least partially comprising hardware logic to determine a plurality of weights for a layer of a convolutional neural network (CNN) comprising a plurality of kernels; organize the plurality of weights into a plurality of clusters for the plurality of kernels; and apply a K-means compression algorithm to each of the plurality of clusters. Other embodiments are also disclosed and claimed.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.