RS-Mamba for Large Remote Sensing Image Dense Prediction
Sijie Zhao, Hao Chen, Xueliang Zhang, Pengfeng Xiao, Lei Bai, and, Wanli Ouyang

TL;DR
The paper introduces RSM, a linear-complexity model that effectively captures global context in large VHR remote sensing images, improving dense prediction tasks like segmentation and change detection.
Contribution
RSM is specifically designed for large remote sensing images, incorporating omnidirectional context modeling with linear complexity, outperforming transformer-based models.
Findings
RSM achieves state-of-the-art performance on dense prediction tasks.
RSM is more efficient and accurate than transformer models on large images.
Larger image sizes improve RSM's performance.
Abstract
Context modeling is critical for remote sensing image dense prediction tasks. Nowadays, the growing size of very-high-resolution (VHR) remote sensing images poses challenges in effectively modeling context. While transformer-based models possess global modeling capabilities, they encounter computational challenges when applied to large VHR images due to their quadratic complexity. The conventional practice of cropping large images into smaller patches results in a notable loss of contextual information. To address these issues, we propose the Remote Sensing Mamba (RSM) for dense prediction tasks in large VHR remote sensing images. RSM is specifically designed to capture the global context of remote sensing images with linear complexity, facilitating the effective processing of large VHR images. Considering that the land covers in remote sensing images are distributed in arbitrary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote Sensing and Land Use · Satellite Image Processing and Photogrammetry · Remote-Sensing Image Classification
MethodsAttention Is All You Need · Response Surface Methodology · Linear Layer · Layer Normalization · Multi-Head Attention · Adam · Byte Pair Encoding · Absolute Position Encodings · Softmax · Dense Connections
