Maximized memory throughput using cooperative thread arrays
US7925860B1 · kind B1 · utility
Assignee
Inventors
Key dates
| Filing date | May 14, 2007 |
| Grant date | Apr 12, 2011 |
| Priority date | — |
| Expiry date | Dec 9, 2028 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F9/3889
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
In parallel processing devices, for streaming computations, processing of each data element of the stream may not be computationally intensive and thus processing may take relatively small amounts of time to compute as compared to memory accesses times required to read the stream and write the results. Therefore, memory throughput often limits the performance of the streaming computation. Generally stated, provided are methods for achieving improved, optimized, or ultimately, maximized memory throughput in such memory-throughput-limited streaming computations. Streaming computation performance is maximized by improving the aggregate memory throughput across the plurality of processing elements and threads. High aggregate memory throughput is achieved by balancing processing loads between threads and groups of threads and a hardware memory interface coupled to the parallel processing devices.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.