Patent · US Active

Topology aware grouping and provisioning of GPU resources in GPU-as-a-Service platform

US10325343B1 · kind B1 · utility

22Cited by
4References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateAug 4, 2017
Grant dateJun 18, 2019
Priority date
Expiry dateOct 17, 2037

Classification

  • Technology area (CPC H)Electricity
  • CPC primaryH04L41/5009
  • WIPO fieldDigital communication
  • WIPO sectorElectrical engineering

Abstract

Techniques are provided for implementing a graphics processing unit (GPU) service platform that is configured to provide topology aware grouping and provisioning of GPU resources for GPU-as-a-Service. A GPU server node receives a service request from a client system for GPU processing services provided by the GPU server node, wherein the GPU server node comprises a plurality of GPU devices. The GPU server node accesses a performance metrics data structure which comprises performance metrics associated with an interconnect topology of the GPU devices and hardware components of the GPU sever node. The GPU server node dynamically forms a group of GPU devices of the GPU server node based on the performance metrics of the accessed data structure, and provisions the dynamically formed group of GPU devices to the client system to handle the service request.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.