Test-Time Adaptation with SaLIP: A Cascade of SAM and CLIP for Zero shot Medical Image Segmentation
Sidra Aleem, Fangyijie Wang, Mayug Maniparambil, Eric Arazo, Julia, Dietlmeier, Guenole Silvestre, Kathleen Curran, Noel E. O'Connor, Suzanne, Little

TL;DR
SaLIP is a novel framework combining SAM and CLIP for zero shot medical image segmentation, achieving significant improvements without training or domain-specific prompts.
Contribution
It introduces a training-free, unified approach that leverages SAM and CLIP for organ segmentation, eliminating the need for labeled data or prompt engineering.
Findings
Substantial DICE score improvements in brain, lung, and fetal head segmentation.
SaLIP outperforms unprompted SAM in zero shot scenarios.
Framework is training and fine-tuning free.
Abstract
The Segment Anything Model (SAM) and CLIP are remarkable vision foundation models (VFMs). SAM, a prompt driven segmentation model, excels in segmentation tasks across diverse domains, while CLIP is renowned for its zero shot recognition capabilities. However, their unified potential has not yet been explored in medical image segmentation. To adapt SAM to medical imaging, existing methods primarily rely on tuning strategies that require extensive data or prior prompts tailored to the specific task, making it particularly challenging when only a limited number of data samples are available. This work presents an in depth exploration of integrating SAM and CLIP into a unified framework for medical image segmentation. Specifically, we propose a simple unified framework, SaLIP, for organ segmentation. Initially, SAM is used for part based segmentation within the image, followed by CLIP to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging and Analysis · Radiomics and Machine Learning in Medical Imaging · AI in cancer detection
MethodsSegment Anything Model · Contrastive Language-Image Pre-training
