ZynqNet: An FPGA-Accelerated Embedded Convolutional Neural Network
David Gschwend

TL;DR
This paper presents ZynqNet, an FPGA-accelerated CNN optimized for embedded image classification, achieving high accuracy with low computational complexity on a Zynq System-on-Chip.
Contribution
It introduces a novel, highly efficient CNN topology and an FPGA-based accelerator optimized for embedded systems with tight constraints.
Findings
Achieves 84.5% top-5 accuracy on ImageNet
Requires only 530 million multiply-accumulate operations
Accelerator operates at 200MHz with high device utilization
Abstract
Image Understanding is becoming a vital feature in ever more applications ranging from medical diagnostics to autonomous vehicles. Many applications demand for embedded solutions that integrate into existing systems with tight real-time and power constraints. Convolutional Neural Networks (CNNs) presently achieve record-breaking accuracies in all image understanding benchmarks, but have a very high computational complexity. Embedded CNNs thus call for small and efficient, yet very powerful computing platforms. This master thesis explores the potential of FPGA-based CNN acceleration and demonstrates a fully functional proof-of-concept CNN implementation on a Zynq System-on-Chip. The ZynqNet Embedded CNN is designed for image classification on ImageNet and consists of ZynqNet CNN, an optimized and customized CNN topology, and the ZynqNet FPGA Accelerator, an FPGA-based architecture for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · CCD and CMOS Imaging Sensors · Brain Tumor Detection and Classification
Methods*Communicated@Fast*How Do I Communicate to Expedia?
