Adaptive Image Zoom-in with Bounding Box Transformation for UAV Object Detection
Tao Wang, Chenyu Lin, Chenwei Tang, Jizhe Zhou, Deng Xiong, Jianan Li, Jian Zhao, Jiancheng Lv

TL;DR
This paper introduces an adaptive zoom-in framework for UAV object detection that improves detection accuracy by focusing on smaller, sparser objects through learned non-uniform zooming and bounding box transformations, applicable across various detection architectures.
Contribution
The work proposes a novel, lightweight adaptive zooming method with a box-based transformation, enhancing UAV object detection performance without significant computational overhead.
Findings
Over 8.4% mAP improvement on SeaDronesSee dataset with Faster R-CNN.
The method is architecture-independent and applicable to multiple detection models.
Achieves high accuracy with minimal latency increase (~3 ms).
Abstract
Detecting objects from UAV-captured images is challenging due to the small object size. In this work, a simple and efficient adaptive zoom-in framework is explored for object detection on UAV images. The main motivation is that the foreground objects are generally smaller and sparser than those in common scene images, which hinders the optimization of effective object detectors. We thus aim to zoom in adaptively on the objects to better capture object features for the detection task. To achieve the goal, two core designs are required: \textcolor{black}{i) How to conduct non-uniform zooming on each image efficiently? ii) How to enable object detection training and inference with the zoomed image space?} Correspondingly, a lightweight offset prediction scheme coupled with a novel box-based zooming objective is introduced to learn non-uniform zooming on the input image. Based on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · UAV Applications and Optimization
