RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic Segmentation
Xianping Ma, Xiaokang Zhang, and Man-On Pun

TL;DR
This paper introduces RS3Mamba, a dual-branch neural network leveraging the novel visual state space model for improved semantic segmentation of remote sensing images, addressing limitations of CNNs and Transformers.
Contribution
It presents the first vision Mamba tailored for remote sensing segmentation, integrating VSS blocks and a collaborative completion module for enhanced global feature modeling.
Findings
Effective on ISPRS Vaihingen dataset
Outperforms existing methods in accuracy
Demonstrates strong generalization on LoveDA dataset
Abstract
Semantic segmentation of remote sensing images is a fundamental task in geoscience research. However, there are some significant shortcomings for the widely used convolutional neural networks (CNNs) and Transformers. The former is limited by its insufficient long-range modeling capabilities, while the latter is hampered by its computational complexity. Recently, a novel visual state space (VSS) model represented by Mamba has emerged, capable of modeling long-range relationships with linear computability. In this work, we propose a novel dual-branch network named remote sensing images semantic segmentation Mamba (RS3Mamba) to incorporate this innovative technology into remote sensing tasks. Specifically, RS3Mamba utilizes VSS blocks to construct an auxiliary branch, providing additional global information to convolution-based main branch. Moreover, considering the distinct…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Remote-Sensing Image Classification · Image Retrieval and Classification Techniques
