Maximized memory throughput on parallel processing devices
US8327123B2 · kind B2 · utility
Assignee
Inventors
Key dates
| Filing date | Mar 23, 2011 |
| Grant date | Dec 4, 2012 |
| Priority date | — |
| Expiry date | Mar 23, 2031 |
Classification
- Technology area (CPC G)Physics
- CPC primaryG06F9/3889
- WIPO fieldComputer technology
- WIPO sectorElectrical engineering
Abstract
In parallel processing devices, for streaming computations, processing of each data element of the stream may not be computationally intensive and thus processing may take relatively small amounts of time to compute as compared to memory accesses times required to read the stream and write the results. Therefore, memory throughput often limits the performance of the streaming computation. Generally stated, provided are methods for achieving improved, optimized, or ultimately, maximized memory throughput in such memory-throughput-limited streaming computations. Streaming computation performance is maximized by improving the aggregate memory throughput across the plurality of processing elements and threads. High aggregate memory throughput is achieved by balancing processing loads between threads and groups of threads and a hardware memory interface coupled to the parallel processing devices.
Source: USPTO / EPO open patent data. Objective bibliographic and citation counts.