Patent · US Active

Host-directed multi-layer neural network processing via per-layer work requests

US11429848B2 · kind B2 · utility

2Cited by
1References
14Claims
0Family size

Assignee

Inventors

Key dates

Filing dateOct 17, 2017
Grant dateAug 30, 2022
Priority date
Expiry dateMay 28, 2040

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH03K19/177
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

In disclosed approaches of neural network processing, a host computer system copies an input data matrix from host memory to a shared memory for performing neural network operations of a first layer of a neural network by a neural network accelerator. The host instructs the neural network accelerator to perform neural network operations of each layer of the neural network beginning with the input data matrix. The neural network accelerator performs neural network operations of each layer in response to the instruction from the host. The host waits until the neural network accelerator signals completion of performing neural network operations of layer i before instructing the neural network accelerator to commence performing neural network operations of layer i+1, for i≥1. The host instructs the neural network accelerator to use a results data matrix in the shared memory from layer i as an input data matrix for layer i+1 for i≥1.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.