Patent · US Active

Methods of optimization of computational graphs of neural networks

US11514324B2 · kind B2 · utility

0Cited by
0References
12Claims
0Family size

Inventors

Key dates

Filing dateMar 29, 2019
Grant dateNov 29, 2022
Priority date
Expiry dateJul 1, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N5/01
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

The present invention discloses a method to optimize a neural network computational graph. The computational graph is used for performing neural network calculation by a computational platform. The computational platform reads data needed by the calculation from off-chip memory. The method comprises: layers which can be fused are selected at least based on an optimization rule to reduce frequency of data exchange between the computational platform and the off-chip memory, carrying out fusion for at least two adjacent layers in the computational graph according to the selected layer objects. Here, the at least two adjacent layers are at least one of the following: horizontally adjacent layers having the same input of feature maps; and longitudinally adjacent layers in which the calculation results of a feature map of a previous layer are at least part of input for a next layer. The method to optimize a computational graph of the present invention can be automatically carried out based on rules or through isomorphic subgraph matching. Thus, an optimal reconstruction mode for executing the computational graph is found out, execution efficiency of the neural network computational platf…

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.