Abstract
Dense linear algebra kernels are critical for wireless, and the oncoming proliferation of 5G only amplifies their importance. Due to the inductive nature of many such algorithms, parallelism is difficult to exploit: parallel regions have fine-grain producer/consumer interaction with iteratively changing depen-dence distance, reuse rate, and memory access patterns. This makes multi-threading impractical due to fine-grain synchronization, and vectorization ineffective due to the non-rectangular iteration domain. CPUs, DSPs, and GPUs perform order-of-magnitude below peak. Our insight is that if the nature of inductive dependences and memory accesses were explicit in the hardware/software interface, then a spatial architecture could efficiently execute parallel code regions. To this end, we first develop a novel execution model, inductive dataflow, where inductive dependence patterns and memory access patterns (streams) are first-order primitives. Second, we develop a hybrid spatial architecture combining systolic and tagged dataflow execution to attain high utilization at low energy and area cost. Finally, we create a scalable design through a novel vector-stream control model which amortizes control overhead both in time and spatially across architecture lanes. We evaluate our design, REVEL, with a full stack (compiler, ISA, simulator, RTL). Across a suite of linear algebra kernels, REVEL outperforms equally-provisioned DSPs by 4.6×-37×. Compared to state-of-the-art spatial architectures, REVEL is mean 3× faster. Compared to a set of ASICs, REVEL is only 2× the power and half the area.
Original language | English (US) |
---|---|
Title of host publication | Proceedings - 2020 IEEE International Symposium on High Performance Computer Architecture, HPCA 2020 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 703-716 |
Number of pages | 14 |
ISBN (Electronic) | 9781728161495 |
DOIs | |
State | Published - Feb 2020 |
Event | 26th IEEE International Symposium on High Performance Computer Architecture, HPCA 2020 - San Diego, United States Duration: Feb 22 2020 → Feb 26 2020 |
Publication series
Name | Proceedings - 2020 IEEE International Symposium on High Performance Computer Architecture, HPCA 2020 |
---|
Conference
Conference | 26th IEEE International Symposium on High Performance Computer Architecture, HPCA 2020 |
---|---|
Country/Territory | United States |
City | San Diego |
Period | 02/22/20 → 02/26/20 |
Bibliographical note
Publisher Copyright:© 2020 IEEE.
Keywords
- Digital Signal Processor
- Reconfigurable Accelerator
- Software/Hardware Codesign
- Spatial Architecture
ASJC Scopus subject areas
- Artificial Intelligence
- Hardware and Architecture
- Safety, Risk, Reliability and Quality