Design and Optimization of Residual Neural Network Accelerators for Low-Power FPGAs Using High-Level Synthesis
Filippo Minnella, Teodoro Urso, Mihai T. Lazarescu, Luciano Lavagno

TL;DR
This paper introduces a high-level synthesis design flow for efficient FPGA implementation of residual neural networks, significantly improving speed, accuracy, and energy efficiency on resource-constrained devices.
Contribution
It presents a novel FPGA design methodology that reduces buffering overhead in ResNets, enabling high-performance, resource-efficient deep learning accelerators using HLS.
Findings
ResNet20 achieves 2.88x speedup with 0.5% higher accuracy.
ResNet8 accuracy improves by 2.8%.
Throughputs reach over 12,000 FPS on Ultra96 and 30,000 FPS on Kria KV260.
Abstract
Residual neural networks are widely used in computer vision tasks. They enable the construction of deeper and more accurate models by mitigating the vanishing gradient problem. Their main innovation is the residual block which allows the output of one layer to bypass one or more intermediate layers and be added to the output of a later layer. Their complex structure and the buffering required by the residual block make them difficult to implement on resource-constrained platforms. We present a novel design flow for implementing deep learning models for field programmable gate arrays optimized for ResNets, using a strategy to reduce their buffering overhead to obtain a resource-efficient implementation of the residual layer. Our high-level synthesis (HLS)-based flow encompasses a thorough set of design principles and optimization strategies, exploiting in novel ways standard techniques…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIntegrated Circuits and Semiconductor Failure Analysis · Advanced Neural Network Applications · VLSI and Analog Circuit Testing
