CoDeNet: Efficient Deployment of Input-Adaptive Object Detection on   Embedded FPGAs

Zhen Dong; Dequan Wang; Qijing Huang; Yizhao Gao; Yaohui Cai; Tian Li,; Bichen Wu; Kurt Keutzer; John Wawrzynek

arXiv:2006.08357·cs.CV·January 27, 2021·1 cites

CoDeNet: Efficient Deployment of Input-Adaptive Object Detection on Embedded FPGAs

Zhen Dong, Dequan Wang, Qijing Huang, Yizhao Gao, Yaohui Cai, Tian Li,, Bichen Wu, Kurt Keutzer, John Wawrzynek

PDF

Open Access 2 Repos

TL;DR

This paper presents CoDeNet, an FPGA-based object detection pipeline that efficiently integrates deformable convolutions, achieving high speed and accuracy with a tiny model size suitable for embedded systems.

Contribution

It introduces a novel FPGA-optimized object detection network with deformable convolutions, including quantization and design tradeoffs for improved efficiency and accuracy.

Findings

01

Achieves 26.9 FPS with 0.76 MB model size and 61.7 AP50 on Pascal VOC.

02

Attains 67.1 AP50 with 2.9 MB parameters, outperforming Tiny-YOLO.

03

Demonstrates effective FPGA deployment of input-adaptive object detection models.

Abstract

Deploying deep learning models on embedded systems has been challenging due to limited computing resources. The majority of existing work focuses on accelerating image classification, while other fundamental vision problems, such as object detection, have not been adequately addressed. Compared with image classification, detection problems are more sensitive to the spatial variance of objects, and therefore, require specialized convolutions to aggregate spatial information. To address this need, recent work introduces dynamic deformable convolution to augment regular convolutions. However, this will lead to inefficient memory accesses of inputs with existing hardware. In this work, we harness the flexibility of FPGAs to develop a novel object detection pipeline with deformable convolutions. We show the speed-accuracy tradeoffs for a set of algorithm modifications including…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · CCD and CMOS Imaging Sensors · Advanced Image and Video Retrieval Techniques

MethodsConvolution · Deformable Convolution