A Multi-Scale Attention-Enhanced Architecture for Gravity Wave Localization in Satellite Imagery
Seraj Al Mahmud Mostafa, Jianwu Wang

TL;DR
This paper introduces YOLO-DCAT, an advanced neural network architecture that enhances gravity wave detection in satellite images by effectively capturing multi-scale features and focusing on relevant regions, significantly improving localization accuracy.
Contribution
The paper presents a novel YOLO-DCAT model with Multi Dilated Residual Convolution and Simplified Spatial and Channel Attention, tailored for complex gravity wave localization in satellite imagery.
Findings
Over 14% increase in mean Average Precision (mAP)
Approximately 17% improvement in Intersection over Union (IoU)
Significant enhancement in localization accuracy for gravity waves
Abstract
Satellite images present unique challenges due to their high object variability and lower spatial resolution, particularly for detecting atmospheric gravity waves which exhibit significant variability in scale, shape, and pattern extent, making accurate localization highly challenging. This variability is further compounded by dominant unwanted objects such as clouds and city lights, as well as instrumental noise, all within a single image channel, while conventional detection methods struggle to capture the diverse and often subtle features of gravity waves across varying conditions. To address these issues, we introduce YOLO-DCAT incorporating Multi Dilated Residual Convolution (MDRC) and Simplified Spatial and Channel Attention (SSCA), an enhanced version of YOLOv5 specifically designed to improve gravity wave localization by effectively handling their complex and variable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
