AMMUNet: Multi-Scale Attention Map Merging for Remote Sensing Image Segmentation
Yang Yang, Shunyi Zheng

TL;DR
AMMUNet introduces a multi-scale attention merging framework with efficient global information capture, significantly improving remote sensing image segmentation accuracy while reducing computational costs.
Contribution
It proposes GMSA and AMMM modules for efficient multi-scale attention merging, enhancing global modeling in remote sensing segmentation.
Findings
Achieved 75.48% mIoU on Vaihingen dataset.
Achieved 77.90% mIoU on Potsdam dataset.
Outperformed existing methods in segmentation accuracy.
Abstract
The advancement of deep learning has driven notable progress in remote sensing semantic segmentation. Attention mechanisms, while enabling global modeling and utilizing contextual information, face challenges of high computational costs and require window-based operations that weaken capturing long-range dependencies, hindering their effectiveness for remote sensing image processing. In this letter, we propose AMMUNet, a UNet-based framework that employs multi-scale attention map merging, comprising two key innovations: the granular multi-head self-attention (GMSA) module and the attention map merging mechanism (AMMM). GMSA efficiently acquires global information while substantially mitigating computational costs in contrast to global multi-head self-attention mechanism. This is accomplished through the strategic utilization of dimension correspondence to align granularity and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Remote-Sensing Image Classification
MethodsALIGN
