Analysis and Adaptation of YOLOv4 for Object Detection in Aerial Images
Aryaman Singh Samyal, Akshatha K R, Soham Hans, Karunakar A K, Satish, Shenoy B

TL;DR
This paper adapts YOLOv4 for aerial image object detection, achieving high accuracy and speed suitable for UAVs, with transfer learning and extensive evaluation demonstrating its effectiveness over other detectors.
Contribution
It presents a tailored adaptation of YOLOv4 for aerial images, optimizing for accuracy and efficiency, and provides a comprehensive comparative analysis with other detectors.
Findings
Achieved 45.64% mAP on VisDrone dataset
Inference speed of 8.7 FPS on Tesla K80 GPU
YOLOv4 outperforms several contemporary aerial detectors
Abstract
The recent and rapid growth in Unmanned Aerial Vehicles (UAVs) deployment for various computer vision tasks has paved the path for numerous opportunities to make them more effective and valuable. Object detection in aerial images is challenging due to variations in appearance, pose, and scale. Autonomous aerial flight systems with their inherited limited memory and computational power demand accurate and computationally efficient detection algorithms for real-time applications. Our work shows the adaptation of the popular YOLOv4 framework for predicting the objects and their locations in aerial images with high accuracy and inference speed. We utilized transfer learning for faster convergence of the model on the VisDrone DET aerial object detection dataset. The trained model resulted in a mean average precision (mAP) of 45.64% with an inference speed reaching 8.7 FPS on the Tesla K80…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques
Methods*Communicated@Fast*How Do I Communicate to Expedia? · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Feature Pyramid Network · Global Average Pooling · Residual Connection · Batch Normalization · Softmax · 1x1 Convolution · BNB Customer Service Number +1-833-534-1729 · Logistic Regression
