Abstract
Convolutional neural networks(CNNs) have demonstrated remarkable capability and scalability in a variety of vision-related tasks. Due to privacy and latency constraints, in some scenarios, the CNNs are deployed on-site where power supply, computation power, and memory capacity are limited. These constraints hinder the traditional training or modification of CNN models, which typically involves network-scale backpropagation of the gradients. In this work, we proposed a framework enabling the derivation of lightweight models from the original model at the edge only utilizing hardware-friendly operations. In the proposed framework, all models are binary quantized and the gradients are obtained by layer-wise decision boundary matching. Hence, the whole flow can be executed with bit-wise and fixed-point arithmetic operations without network-scale gradient backpropagations. The derived model serves as a viable alternative to the original, in scenarios where the accuracy requirements are less stringent, delivering enhanced efficiencies in power and memory consumption. We validate the framework on digit recognition tasks, obtaining a better accuracy than naively deploying the same lightweight model. Furthermore, an FPGA demonstration of our framework achieved a throughput of 2.2 TOPS/s, underscoring its practical applicability.
Original language | English (US) |
---|---|
Title of host publication | 2024 IEEE 6th International Conference on AI Circuits and Systems, AICAS 2024 - Proceedings |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 11-15 |
Number of pages | 5 |
ISBN (Electronic) | 9798350383638 |
DOIs | |
State | Published - 2024 |
Event | 6th IEEE International Conference on AI Circuits and Systems, AICAS 2024 - Abu Dhabi, United Arab Emirates Duration: Apr 22 2024 → Apr 25 2024 |
Publication series
Name | 2024 IEEE 6th International Conference on AI Circuits and Systems, AICAS 2024 - Proceedings |
---|
Conference
Conference | 6th IEEE International Conference on AI Circuits and Systems, AICAS 2024 |
---|---|
Country/Territory | United Arab Emirates |
City | Abu Dhabi |
Period | 04/22/24 → 04/25/24 |
Bibliographical note
Publisher Copyright:© 2024 IEEE.
Keywords
- binary neural network(BNN)
- edge computing
- FPGA accelerator
ASJC Scopus subject areas
- Artificial Intelligence
- Computer Vision and Pattern Recognition
- Hardware and Architecture
- Electrical and Electronic Engineering
- Instrumentation