SAM-Mamba: Mamba Guided SAM Architecture for Generalized Zero-Shot Polyp Segmentation
Tapas Kumar Dutta, Snehashis Majhi, Deepak Ranjan Nayak, and Debesh, Jha

TL;DR
SAM-Mamba enhances polyp segmentation by integrating a domain prior into the SAM architecture, improving accuracy and generalization across datasets, and addressing limitations of CNN and ViT models in capturing local and global context.
Contribution
The paper introduces SAM-Mamba, a novel architecture that combines a Mamba-Prior module with SAM for improved zero-shot polyp segmentation.
Findings
Outperforms CNN, ViT, and Adapter-based models on five datasets.
Demonstrates strong zero-shot generalization to unseen datasets.
Suitable for real-time clinical applications.
Abstract
Polyp segmentation in colonoscopy is crucial for detecting colorectal cancer. However, it is challenging due to variations in the structure, color, and size of polyps, as well as the lack of clear boundaries with surrounding tissues. Traditional segmentation models based on Convolutional Neural Networks (CNNs) struggle to capture detailed patterns and global context, limiting their performance. Vision Transformer (ViT)-based models address some of these issues but have difficulties in capturing local context and lack strong zero-shot generalization. To this end, we propose the Mamba-guided Segment Anything Model (SAM-Mamba) for efficient polyp segmentation. Our approach introduces a Mamba-Prior module in the encoder to bridge the gap between the general pre-trained representation of SAM and polyp-relevant trivial clues. It injects salient cues of polyp images into the SAM image encoder…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMetallurgy and Material Forming · Advancements in Photolithography Techniques · Image and Object Detection Techniques
MethodsAttention Is All You Need · Adam · Dropout · Position-Wise Feed-Forward Layer · Softmax · Dense Connections · Byte Pair Encoding · Linear Layer · Multi-Head Attention · Label Smoothing
