Enhancing Satellite Object Localization with Dilated Convolutions and Attention-aided Spatial Pooling
Seraj Al Mahmud Mostafa, Chenxi Wang, Jia Yue, Yuta Hozumi, Jianwu Wang

TL;DR
This paper introduces YOLO-DCAP, an improved satellite object localization model that uses multi-scale dilated residual convolutions and attention-aided spatial pooling, significantly outperforming existing methods across multiple datasets.
Contribution
The paper proposes YOLO-DCAP, a novel model combining MDRC and AaSP modules to enhance satellite object localization, demonstrating superior performance over existing models.
Findings
20.95% average mAP50 improvement over YOLO base model
32.23% average IoU improvement over YOLO base model
Consistent performance gains across three satellite datasets
Abstract
Object localization in satellite imagery is particularly challenging due to the high variability of objects, low spatial resolution, and interference from noise and dominant features such as clouds and city lights. In this research, we focus on three satellite datasets: upper atmospheric Gravity Waves (GW), mesospheric Bores (Bore), and Ocean Eddies (OE), each presenting its own unique challenges. These challenges include the variability in the scale and appearance of the main object patterns, where the size, shape, and feature extent of objects of interest can differ significantly. To address these challenges, we introduce YOLO-DCAP, a novel enhanced version of YOLOv5 designed to improve object localization in these complex scenarios. YOLO-DCAP incorporates a Multi-scale Dilated Residual Convolution (MDRC) block to capture multi-scale features at scale with varying dilation rates, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Remote-Sensing Image Classification · Image Enhancement Techniques
MethodsFocus · Balanced Selection · Gravity · Convolution
