Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model
Qinfeng Zhu, Yuanzhi Cai, Yuan Fang, Yihan Yang, Cheng Chen, Lei Fan,, Anh Nguyen

TL;DR
Samba introduces a novel State Space Model-based framework for semantic segmentation of high-resolution remotely sensed images, outperforming CNN and ViT methods and establishing new benchmarks in the field.
Contribution
This paper is the first to apply State Space Models to semantic segmentation of remote sensing images, demonstrating superior performance over existing CNN and ViT approaches.
Findings
Samba achieves state-of-the-art results on LoveDA, ISPRS Vaihingen, and Potsdam datasets.
Samba outperforms traditional CNN and ViT methods in accuracy and efficiency.
The approach sets new performance benchmarks for remote sensing image segmentation.
Abstract
High-resolution remotely sensed images pose a challenge for commonly used semantic segmentation methods such as Convolutional Neural Network (CNN) and Vision Transformer (ViT). CNN-based methods struggle with handling such high-resolution images due to their limited receptive field, while ViT faces challenges in handling long sequences. Inspired by Mamba, which adopts a State Space Model (SSM) to efficiently capture global semantic information, we propose a semantic segmentation framework for high-resolution remotely sensed images, named Samba. Samba utilizes an encoder-decoder architecture, with Samba blocks serving as the encoder for efficient multi-level semantic information extraction, and UperNet functioning as the decoder. We evaluate Samba on the LoveDA, ISPRS Vaihingen, and ISPRS Potsdam datasets, comparing its performance against top-performing CNN and ViT methods. The results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Remote-Sensing Image Classification · Neural Networks and Applications
MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Multi-Head Attention · Adam · Byte Pair Encoding · Absolute Position Encodings · Softmax · Dense Connections · Label Smoothing
