ShuffleDet: Real-Time Vehicle Detection Network in On-board Embedded UAV Imagery
Seyed Majid Azimi

TL;DR
ShuffleDet is a fast, lightweight vehicle detection network designed for UAVs that uses channel shuffling and grouped convolutions to achieve real-time performance with competitive accuracy.
Contribution
The paper introduces ShuffleDet, a novel vehicle detection network optimized for embedded UAV platforms, combining channel shuffling, grouped convolutions, and specialized modules for improved speed and shape consideration.
Findings
Achieves 14 FPS on NVIDIA Jetson TX2.
Uses only 3.8 GFLOPs for real-time detection.
Performs competitively on CARPK and PUCPR+ datasets.
Abstract
On-board real-time vehicle detection is of great significance for UAVs and other embedded mobile platforms. We propose a computationally inexpensive detection network for vehicle detection in UAV imagery which we call ShuffleDet. In order to enhance the speed-wise performance, we construct our method primarily using channel shuffling and grouped convolutions. We apply inception modules and deformable modules to consider the size and geometric shape of the vehicles. ShuffleDet is evaluated on CARPK and PUCPR+ datasets and compared against the state-of-the-art real-time object detection networks. ShuffleDet achieves 3.8 GFLOPs while it provides competitive performance on test sets of both datasets. We show that our algorithm achieves real-time performance by running at the speed of 14 frames per second on NVIDIA Jetson TX2 showing high potential for this method for real-time processing in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
