TL;DR
This paper develops real-time object detectors optimized for embedded AI edge platforms, balancing detection accuracy and inference speed using shallow CNN backbones and various architectures.
Contribution
It introduces a real-time object detection model tailored for embedded systems, utilizing shallow CNN backbones and intermediate layer optimization for balanced performance.
Findings
Achieved real-time detection on NVIDIA Drive PX2 and Jetson Xavier.
Demonstrated balanced accuracy and speed across multiple backbone architectures.
Provided publicly available code and models for embedded object detection.
Abstract
With the improvements in the object detection networks, several variations of object detection networks have been achieved impressive performance. However, the performance evaluation of most models has focused on detection accuracy, and performance verification is mostly based on high-end GPU hardware. In this paper, we propose real-time object detectors that guarantee balanced performance for real-time systems on embedded platforms. The proposed model utilizes the basic head structure of the RefineDet model, which is a variant of the single-shot object detector (SSD). In order to ensure real-time performance, CNN models with relatively shallow layers or fewer parameters have been used as the backbone structure. In addition to the basic VGGNet and ResNet structures, various backbone structures such as MobileNet, Xception, ResNeXt, Inception-SENet, and SE-ResNeXt have been used for this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Depthwise Convolution · Pointwise Convolution · Average Pooling · ResNeXt Block · Grouped Convolution · ResNeXt · Depthwise Separable Convolution · Softmax · Dense Connections
