Are foundation models efficient for medical image segmentation?

Danielle Ferreira; Rima Arnaout

arXiv:2311.04847·eess.IV·March 11, 2025·1 cites

Are foundation models efficient for medical image segmentation?

Danielle Ferreira, Rima Arnaout

PDF

Open Access

TL;DR

This study compares the efficiency of foundation models like SAM with self-supervised learning methods for cardiac ultrasound segmentation, finding that SAM is less efficient and performs poorly in this medical imaging task.

Contribution

The paper provides a direct comparison between foundation models and SSL methods in medical image segmentation, highlighting the limitations of foundation models in this domain.

Findings

01

SAM performed poorly compared to SSL

02

SAM required more labeling and compute resources

03

SSL was more efficient for cardiac ultrasound segmentation

Abstract

Foundation models are experiencing a surge in popularity. The Segment Anything model (SAM) asserts an ability to segment a wide spectrum of objects but required supervised training at unprecedented scale. We compared SAM's performance (against clinical ground truth) and resources (labeling time, compute) to a modality-specific, label-free self-supervised learning (SSL) method on 25 measurements for 100 cardiac ultrasounds. SAM performed poorly and required significantly more labeling and computing resources, demonstrating worse efficiency than SSL.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRadiomics and Machine Learning in Medical Imaging · AI in cancer detection · Machine Learning in Healthcare

MethodsSegment Anything Model