TL;DR
This paper introduces SlimYOLOv3, a pruned, efficient version of YOLOv3 designed for real-time UAV applications, achieving significant reductions in computational cost while maintaining comparable accuracy.
Contribution
The paper proposes a channel pruning method to create SlimYOLOv3, a lightweight, faster, and more efficient object detector suitable for embedded UAV systems.
Findings
FLOPs reduced by ~90.8%
Parameter size decreased by ~92.0%
Runs approximately twice as fast as YOLOv3
Abstract
Drones or general Unmanned Aerial Vehicles (UAVs), endowed with computer vision function by on-board cameras and embedded systems, have become popular in a wide range of applications. However, real-time scene parsing through object detection running on a UAV platform is very challenging, due to limited memory and computing power of embedded devices. To deal with these challenges, in this paper we propose to learn efficient deep object detectors through channel pruning of convolutional layers. To this end, we enforce channel-level sparsity of convolutional layers by imposing L1 regularization on channel scaling factors and prune less informative feature channels to obtain "slim" object detectors. Based on such approach, we present SlimYOLOv3 with fewer trainable parameters and floating point operations (FLOPs) in comparison of original YOLOv3 (Joseph Redmon et al., 2018) as a promising…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPruning · Average Pooling · Logistic Regression · Global Average Pooling · 1x1 Convolution · Batch Normalization · k-Means Clustering · Softmax · Residual Connection · Convolution
