Swin-LiteMedSAM: A Lightweight Box-Based Segment Anything Model for Large-Scale Medical Image Datasets
Ruochen Gao, Donghang Lyu, and Marius Staring

TL;DR
This paper introduces Swin-LiteMedSAM, a lightweight medical image segmentation model that balances high performance with computational efficiency, suitable for large-scale medical datasets and real-world applications.
Contribution
It proposes a novel lightweight model using a tiny Swin Transformer, multiple prompt types, and skip connections, improving segmentation speed and accuracy over existing lightweight models.
Findings
Achieved DSC of 0.8678 and NSD of 0.8844 on validation set.
Secured fourth place in the CVPR 2024 challenge.
Demonstrated effective performance across multiple medical imaging modalities.
Abstract
Medical imaging is essential for the diagnosis and treatment of diseases, with medical image segmentation as a subtask receiving high attention. However, automatic medical image segmentation models are typically task-specific and struggle to handle multiple scenarios, such as different imaging modalities and regions of interest. With the introduction of the Segment Anything Model (SAM), training a universal model for various clinical scenarios has become feasible. Recently, several Medical SAM (MedSAM) methods have been proposed, but these models often rely on heavy image encoders to achieve high performance, which may not be practical for real-world applications due to their high computational demands and slow inference speed. To address this issue, a lightweight version of the MedSAM (LiteMedSAM) can provide a viable solution, achieving high performance while requiring fewer resources…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in cancer detection · COVID-19 diagnosis using AI · Artificial Intelligence in Healthcare
MethodsByte Pair Encoding · Absolute Position Encodings · Softmax · Label Smoothing · Dropout · Layer Normalization · Attention Is All You Need · Position-Wise Feed-Forward Layer · Linear Layer · Adam
