Boundary and Position Information Mining for Aerial Small Object Detection
Rongxin Huang, Guangfeng Lin, Wenbo Zhou, Zhirong Li, Wenhuan Wu

TL;DR
This paper introduces BPIM, a novel framework that enhances small object detection in aerial images by integrating boundary, position, and scale information through attention mechanisms and multi-scale feature fusion.
Contribution
The paper proposes a comprehensive boundary and position information mining framework that improves small object detection accuracy in UAV imagery, outperforming baseline models and achieving state-of-the-art results.
Findings
BPIM outperforms baseline Yolov5-P2 on multiple datasets.
BPIM achieves state-of-the-art detection performance with comparable computation.
Boundary and position information integration enhances small object perception.
Abstract
Unmanned Aerial Vehicle (UAV) applications have become increasingly prevalent in aerial photography and object recognition. However, there are major challenges to accurately capturing small targets in object detection due to the imbalanced scale and the blurred edges. To address these issues, boundary and position information mining (BPIM) framework is proposed for capturing object edge and location cues. The proposed BPIM includes position information guidance (PIG) module for obtaining location information, boundary information guidance (BIG) module for extracting object edge, cross scale fusion (CSF) module for gradually assembling the shallow layer image feature, three feature fusion (TFF) module for progressively combining position and boundary information, and adaptive weight fusion (AWF) module for flexibly merging the deep layer semantic feature. Therefore, BPIM can integrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Robotics and Sensor-Based Localization
