Patent · US Active

Managing data sparsity for neural networks

US11392829B1 · kind B1 · utility

14Cited by
0References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateApr 2, 2019
Grant dateJul 19, 2022
Priority date
Expiry dateJun 27, 2039

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/084
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Approaches in accordance with various embodiments provide for the processing of sparse matrices for mathematical and programmatic operations. In particular, various embodiments enforce sparsity constraints for performing sparse matrix multiply-add instruction (MMA) operations. Deep neural networks can exhibit significant sparsity in the data used in operations, both in the activations and weights. The computational load can be reduced by excluding zero-valued data elements. A sparsity constraint is applied across all submatrices of a sparse matrix, providing fine-grained structured sparsity that is evenly distributed across the matrix. The matrix may then be compressed since a minimum number of elements of the matrix are known to have zero value. Matrix operations are then performed using these matrices.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.