CoDeNet: Efficient Deployment of Input-Adaptive Object Detection on Embedded FPGAs
Zhen Dong, Dequan Wang, Qijing Huang, Yizhao Gao, Yaohui Cai, Tian Li,, Bichen Wu, Kurt Keutzer, John Wawrzynek

TL;DR
This paper presents CoDeNet, an FPGA-based object detection pipeline that efficiently integrates deformable convolutions, achieving high speed and accuracy with a tiny model size suitable for embedded systems.
Contribution
It introduces a novel FPGA-optimized object detection network with deformable convolutions, including quantization and design tradeoffs for improved efficiency and accuracy.
Findings
Achieves 26.9 FPS with 0.76 MB model size and 61.7 AP50 on Pascal VOC.
Attains 67.1 AP50 with 2.9 MB parameters, outperforming Tiny-YOLO.
Demonstrates effective FPGA deployment of input-adaptive object detection models.
Abstract
Deploying deep learning models on embedded systems has been challenging due to limited computing resources. The majority of existing work focuses on accelerating image classification, while other fundamental vision problems, such as object detection, have not been adequately addressed. Compared with image classification, detection problems are more sensitive to the spatial variance of objects, and therefore, require specialized convolutions to aggregate spatial information. To address this need, recent work introduces dynamic deformable convolution to augment regular convolutions. However, this will lead to inefficient memory accesses of inputs with existing hardware. In this work, we harness the flexibility of FPGAs to develop a novel object detection pipeline with deformable convolutions. We show the speed-accuracy tradeoffs for a set of algorithm modifications including…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · CCD and CMOS Imaging Sensors · Advanced Image and Video Retrieval Techniques
MethodsConvolution · Deformable Convolution
