Maximizing resource utilization of neural network computing system
US11609792B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 19, 2019 |
| Grant date | Mar 21, 2023 |
| Priority date | — |
| Expiry date | Dec 23, 2041 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06N3/063
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
The present disclosure relates to a method for allocating resources of an accelerator to two or more neural networks for execution. The two or more neural networks may include a first neural network and a second neural network. The method comprises analyzing workloads of the first neural network and the second neural network, wherein the first neural network and second neural network each includes multiple computational layers, evaluating computational resources of the accelerator for executing each computational layer of the first and second neural networks, and scheduling computational resources of the accelerator to execute one computational layer of the multiple computation layers of the first neural network and to execute one or more computational layers of the multiple computational layers of the second neural network.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.