Patent · US Active

Maximized memory throughput using cooperative thread arrays

US7925860B1 · kind B1 · utility

27Cited by
10References
16Claims
0Family size

Assignee

Inventors

Key dates

Filing dateMay 14, 2007
Grant dateApr 12, 2011
Priority date
Expiry dateDec 9, 2028

Classification

  • Technology area (CPC G)Physics
  • CPC primaryG06F9/3889
  • WIPO fieldComputer technology
  • WIPO sectorElectrical engineering

Abstract

In parallel processing devices, for streaming computations, processing of each data element of the stream may not be computationally intensive and thus processing may take relatively small amounts of time to compute as compared to memory accesses times required to read the stream and write the results. Therefore, memory throughput often limits the performance of the streaming computation. Generally stated, provided are methods for achieving improved, optimized, or ultimately, maximized memory throughput in such memory-throughput-limited streaming computations. Streaming computation performance is maximized by improving the aggregate memory throughput across the plurality of processing elements and threads. High aggregate memory throughput is achieved by balancing processing loads between threads and groups of threads and a hardware memory interface coupled to the parallel processing devices.

Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.