Patent · US Active

Heterogeneous ML accelerator cluster with flexible system resource balance

US12417047B2 · kind B2 · utility

0Cited by
5References
20Claims
0Family size

Assignee

Inventors

Key dates

Filing dateJan 10, 2023
Grant dateSep 16, 2025
Priority date
Expiry dateMar 15, 2043

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F15/7896
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

Aspects of the disclosure are directed to a heterogeneous machine learning accelerator system with compute and memory nodes connected by high speed chip-to-chip interconnects. While existing remote/disaggregated memory may require memory expansion via remote processing units, aspects of the disclosure add memory nodes into machine learning accelerator clusters via the chip-to-chip interconnects without needing assistance from remote processing units to achieve higher performance, simpler software stack, and/or lower cost. The memory nodes may support prefetch and intelligent compression to enable the use of low cost memory without performance degradation.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.