Patent · US Active

Allocating computations of a machine learning network in a machine learning accelerator

US11734605B2 · kind B2 · utility

1Cited by
1References
24Claims
0Family size

Assignee

Inventors

Key dates

Filing dateApr 29, 2020
Grant dateAug 22, 2023
Priority date
Expiry dateAug 9, 2041

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06N3/082
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

A compiler receives a description of a machine learning network and generates a computer program that implements the machine learning network. The compiler allocates instructions of the computer program to different groups of processing elements (Tiles) for execution such that different groups of Tiles implement different layers of the machine learning network. The compiler may determine the size of the different groups based on a partial computation metric associated with the computations performed to implement the corresponding layer. Furthermore, the compiler may assign specific Tiles to each group based on a set of predefined layout constraints. The compiler may statically schedule at least a portion of the instructions into one or more deterministic phases for execution by the groups of Tiles.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.