Patent · US Active

Scheduling neural network computations based on memory capacity

US11461631B2 · kind B2 · utility

1Cited by
5References
18Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMar 22, 2018
Grant dateOct 4, 2022
Priority date
Expiry dateMay 8, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F12/0862
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Disclosed herein are techniques for scheduling and executing multi-layer neural network computations for multiple contexts. In one embodiment, a method comprises determining a set of computation tasks to be executed, the set of computation tasks including a first computation task and a second computation task, as well as a third computation task and a fourth computation task to provide input data for the first and second computation tasks; determining a first execution batch comprising the first and second computation tasks; determining a second execution batch comprising at least the third computation task to be executed before the first execution batch; determining whether to include the fourth computation task in the second execution batch based on whether the memory device has sufficient capacity to hold input data and output data of both of the third and fourth computation; executing the second execution batch followed by the first execution batch.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.