Evaluation of YOLO Models with Sliced Inference for Small Object Detection
Muhammed Can Keles, Batuhan Salmanoglu, Mehmet Serdar Guzel, Baran, Gursoy, Gazi Erkan Bostanci

TL;DR
This paper benchmarks YOLOv5 and YOLOX models for small object detection using the VisDrone2019Det dataset, demonstrating that sliced inference and fine-tuning significantly improve detection accuracy, especially for YOLOv5 models.
Contribution
It introduces the use of sliced inference and fine-tuning for YOLO models to enhance small object detection performance.
Findings
Sliced inference increases AP50 scores across models.
YOLOv5 models benefit more from sliced inference than YOLOX.
Combining sliced fine-tuning and inference yields substantial accuracy improvements.
Abstract
Small object detection has major applications in the fields of UAVs, surveillance, farming and many others. In this work we investigate the performance of state of the art Yolo based object detection models for the task of small object detection as they are one of the most popular and easy to use object detection models. We evaluated YOLOv5 and YOLOX models in this study. We also investigate the effects of slicing aided inference and fine-tuning the model for slicing aided inference. We used the VisDrone2019Det dataset for training and evaluating our models. This dataset is challenging in the sense that most objects are relatively small compared to the image sizes. This work aims to benchmark the YOLOv5 and YOLOX models for small object detection. We have seen that sliced inference increases the AP50 score in all experiments, this effect was greater for the YOLOv5 models compared to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Robotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques
MethodsBNB Customer Service Number +1-833-534-1729 · You Only Look Once · Average Pooling · 1x1 Convolution · Batch Normalization · Softmax · Convolution · Global Average Pooling · Residual Connection · CSPDarknet53
