Hardware-Friendly Lightweight Convolutional Neural Network Derivation at The Edge

Li Zhang*, Ahmed M. Eltawil, Khaled N. Salama

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Convolutional neural networks(CNNs) have demonstrated remarkable capability and scalability in a variety of vision-related tasks. Due to privacy and latency constraints, in some scenarios, the CNNs are deployed on-site where power supply, computation power, and memory capacity are limited. These constraints hinder the traditional training or modification of CNN models, which typically involves network-scale backpropagation of the gradients. In this work, we proposed a framework enabling the derivation of lightweight models from the original model at the edge only utilizing hardware-friendly operations. In the proposed framework, all models are binary quantized and the gradients are obtained by layer-wise decision boundary matching. Hence, the whole flow can be executed with bit-wise and fixed-point arithmetic operations without network-scale gradient backpropagations. The derived model serves as a viable alternative to the original, in scenarios where the accuracy requirements are less stringent, delivering enhanced efficiencies in power and memory consumption. We validate the framework on digit recognition tasks, obtaining a better accuracy than naively deploying the same lightweight model. Furthermore, an FPGA demonstration of our framework achieved a throughput of 2.2 TOPS/s, underscoring its practical applicability.

Original languageEnglish (US)
Title of host publication2024 IEEE 6th International Conference on AI Circuits and Systems, AICAS 2024 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages11-15
Number of pages5
ISBN (Electronic)9798350383638
DOIs
StatePublished - 2024
Event6th IEEE International Conference on AI Circuits and Systems, AICAS 2024 - Abu Dhabi, United Arab Emirates
Duration: Apr 22 2024Apr 25 2024

Publication series

Name2024 IEEE 6th International Conference on AI Circuits and Systems, AICAS 2024 - Proceedings

Conference

Conference6th IEEE International Conference on AI Circuits and Systems, AICAS 2024
Country/TerritoryUnited Arab Emirates
CityAbu Dhabi
Period04/22/2404/25/24

Bibliographical note

Publisher Copyright:
© 2024 IEEE.

Keywords

  • binary neural network(BNN)
  • edge computing
  • FPGA accelerator

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Vision and Pattern Recognition
  • Hardware and Architecture
  • Electrical and Electronic Engineering
  • Instrumentation

Fingerprint

Dive into the research topics of 'Hardware-Friendly Lightweight Convolutional Neural Network Derivation at The Edge'. Together they form a unique fingerprint.

Cite this