SAM Fails to Segment Anything? -- SAM-Adapter: Adapting SAM in Underperformed Scenes: Camouflage, Shadow, Medical Image Segmentation, and More
Tianrun Chen, Lanyun Zhu, Chaotao Ding, Runlong Cao, Yan Wang, Zejian, Li, Lingyun Sun, Papa Mao, Ying Zang

TL;DR
This paper introduces SAM-Adapter, a method to adapt the SAM model for challenging segmentation tasks like camouflage, shadow detection, and medical imaging, significantly improving performance without fine-tuning.
Contribution
We propose SAM-Adapter, a simple yet effective approach to enhance SAM's performance in difficult segmentation tasks by incorporating domain-specific prompts and knowledge.
Findings
SAM-Adapter outperforms existing models in camouflaged object detection.
SAM-Adapter achieves state-of-the-art results in shadow detection.
Medical image segmentation results are improved with SAM-Adapter.
Abstract
The emergence of large models, also known as foundation models, has brought significant advancements to AI research. One such model is Segment Anything (SAM), which is designed for image segmentation tasks. However, as with other foundation models, our experimental findings suggest that SAM may fail or perform poorly in certain segmentation tasks, such as shadow detection and camouflaged object detection (concealed object detection). This study first paves the way for applying the large pre-trained image segmentation model SAM to these downstream tasks, even in situations where SAM performs poorly. Rather than fine-tuning the SAM network, we propose \textbf{SAM-Adapter}, which incorporates domain-specific information or visual prompts into the segmentation network by using simple yet effective adapters. By integrating task-specific knowledge with general knowledge learnt by the large…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques
MethodsSegment Anything Model · fail
