Patent · US Active

ReLU compression to reduce GPU memory

US11362670B2 · kind B2 · utility

1Cited by
0References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 30, 2020
Grant dateJun 14, 2022
Priority date
Expiry dateFeb 10, 2041

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH03M7/6023
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A method is presented for compressing data of a Rectified Linear Unit (ReLU) function on a graphical processing unit (GPU) employed in a learning process of a deep neural network. The method includes converting an initial data structure including nonzero data and zero data into a compressed data structure including only the nonzero data of the initial data structure as compressed data by generating a nonzero data bitmap region, generating a nonzero data number table region by employing a parallel reduction algorithm, calculating a nonzero data array index per block region of all blocks from the nonzero data number table region by employing a parallel prefix sum scan algorithm, allocating a buffer for the compressed data; and copying the nonzero data from the initial data structure into a nonzero data array region in a compressed data format in parallel.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.