Topology aware grouping and provisioning of GPU resources in GPU-as-a-Service platform
US10325343B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | Aug 4, 2017 |
| Grant date | Jun 18, 2019 |
| Priority date | — |
| Expiry date | Oct 17, 2037 |
Classification
- Technology area (CPC H)Electricity
- CPC primaryH04L41/5009
- WIPO fieldDigital communication
- WIPO sectorElectrical engineering
Abstract
Techniques are provided for implementing a graphics processing unit (GPU) service platform that is configured to provide topology aware grouping and provisioning of GPU resources for GPU-as-a-Service. A GPU server node receives a service request from a client system for GPU processing services provided by the GPU server node, wherein the GPU server node comprises a plurality of GPU devices. The GPU server node accesses a performance metrics data structure which comprises performance metrics associated with an interconnect topology of the GPU devices and hardware components of the GPU sever node. The GPU server node dynamically forms a group of GPU devices of the GPU server node based on the performance metrics of the accessed data structure, and provisions the dynamically formed group of GPU devices to the client system to handle the service request.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.