Abstract
Data movement and communication have become the primary bottlenecks in large multicore systems. The near-data computing paradigm provides a solution: move computation to where the data resides on-chip. Two challenges keep near-data computing from the mainstream: lack of programmer transparency and applicability. Programmer transparency requires providing sequential memory semantics with distributed computation, which requires burdensome coordination. Broad applicability requires support for combinations of address patterns (e.g. affine, indirect, multi-operand) and computation types (loads, stores, reductions, atomics).We find that streams - coarse grain memory access patterns - are a powerful ISA abstraction for near data offloading. Tracking data access at stream-granularity heavily reduces the burden of coordination for providing sequential semantics. Decomposing the problem using streams means that arbitrary combinations of address and computation patterns can be combined for broad generality.With this insight, we develop a paradigm called near-stream computing, comprising a compiler, CPU ISA extension, and a microarchitecture that facilitate programmer transparent computation offloading to shared caches. We evaluate our system on OpenMP kernels that stress broad addressing and compute behavior, and find that 46% of dynamic instructions can be offloaded to remote banks, reducing the network traffic by 76%. Overall it achieves 2.13× speedup over a state-of-the-art near-data computing technique, with a 1.90× energy efficiency gain.
Original language | English (US) |
---|---|
Title of host publication | Proceedings - 2022 IEEE International Symposium on High-Performance Computer Architecture, HPCA 2022 |
Publisher | IEEE Computer Society |
Pages | 331-345 |
Number of pages | 15 |
ISBN (Electronic) | 9781665420273 |
DOIs | |
State | Published - 2022 |
Event | 28th Annual IEEE International Symposium on High-Performance Computer Architecture, HPCA 2022 - Virtual, Online, Korea, Republic of Duration: Apr 2 2022 → Apr 6 2022 |
Publication series
Name | Proceedings - International Symposium on High-Performance Computer Architecture |
---|---|
Volume | 2022-April |
ISSN (Print) | 1530-0897 |
Conference
Conference | 28th Annual IEEE International Symposium on High-Performance Computer Architecture, HPCA 2022 |
---|---|
Country/Territory | Korea, Republic of |
City | Virtual, Online |
Period | 04/2/22 → 04/6/22 |
Bibliographical note
Publisher Copyright:© 2022 IEEE.
Keywords
- Near-Data Computing
- Programmer-Transparent Acceleration
- Stream-Based ISAs
ASJC Scopus subject areas
- Hardware and Architecture