Abstract
FPGAs have been proven to be powerful computational accelerators across many types of workloads. The mainstream programming approach is high level synthesis (HLS), which maps high-level languages (e.g. C+ #pragmas) to hardware. Unfortunately, HLS leaves a significant programmability gap in terms of reconfigurability, customization and versatility: Although HLS compilation is fast, the downstream physical design takes hours to days; FPGA reconfiguration time limits the time-multiplexing ability of hardware, and tools do not reason about cross-workload flexibility. Overlay architectures mitigate the above by mapping a programmable design (e.g. CPU, GPU, etc.) on top of FPGAs. However, the abstraction gap between overlay and FPGA leads to low efficiency/utilization. Our essential idea is to develop a hardware generation framework targeting a highly-customizable overlay, so that the abstraction gap can be lowered by tuning the design instance to applications of interest. We leverage and extend prior work on customizable spatial architectures, SoC generation, accelerator compilers, and design space explorers to create an end-to-end FPGA acceleration system. Our novel techniques address inefficient networks between on-chip memories and processing elements, as well as improving DSE by reducing the amount of recompilation required. Our framework, OverGen, is highly competitive with fixed-function HLS-based designs, even though the generated designs are programmable with fast reconfiguration. We compared to a state-of-the-art DSE-based HLS framework, AutoDSE. Without kernel-tuning for AutoDSE, OverGen gets 1.2 × geomean performance, and even with manual kernel-tuning for the baseline, OverGen still gets 0.55 × geomean performance - all while providing runtime flexibility across workloads.
Original language | English (US) |
---|---|
Title of host publication | Proceedings - 2022 55th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2022 |
Publisher | IEEE Computer Society |
Pages | 35-56 |
Number of pages | 22 |
ISBN (Electronic) | 9781665462723 |
DOIs | |
State | Published - 2022 |
Event | 55th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2022 - Chicago, United States Duration: Oct 1 2022 → Oct 5 2022 |
Publication series
Name | Proceedings of the Annual International Symposium on Microarchitecture, MICRO |
---|---|
Volume | 2022-October |
ISSN (Print) | 1072-4451 |
Conference
Conference | 55th Annual IEEE/ACM International Symposium on Microarchitecture, MICRO 2022 |
---|---|
Country/Territory | United States |
City | Chicago |
Period | 10/1/22 → 10/5/22 |
Bibliographical note
Publisher Copyright:© 2022 IEEE.
Keywords
- CGRA
- Design Automation
- Domain-specific Accelerators
- FPGA
- Reconfigurable architectures
ASJC Scopus subject areas
- Hardware and Architecture