Prompting with the human-touch: evaluating model-sensitivity of foundation models for musculoskeletal CT segmentation
Caroline Magg, Maaike A. ter Wee, Johannes G.G. Dobbe, Geert J. Streekstra, Leendert Blankevoort, Clara I. S\'anchez, Hoel Kervadec

TL;DR
This study evaluates the performance and sensitivity of 11 promptable foundation models for musculoskeletal CT segmentation across various anatomical regions, highlighting variability in performance and the impact of human prompts on model robustness.
Contribution
The paper provides a comprehensive benchmark of multiple foundation models for medical image segmentation, analyzing their sensitivity to human prompts and identifying Pareto-optimal models for different settings.
Findings
Segmentation performance varies significantly between models and prompting strategies.
Pareto-optimal models identified: SAM and SAM2.1 in 2D, nnInteractive and Med-SAM2 in 3D.
Performance drops when using human prompts compared to ideal prompts.
Abstract
Promptable Foundation Models (FMs), initially introduced for natural image segmentation, have also revolutionized medical image segmentation. The increasing number of models, along with evaluations varying in datasets, metrics, and compared models, makes direct performance comparison between models difficult and complicates the selection of the most suitable model for specific clinical tasks. In our study, 11 promptable FMs are tested using non-iterative 2D and 3D prompting strategies on a private and public dataset focusing on bone and implant segmentation in four anatomical regions (wrist, shoulder, hip and lower leg). The Pareto-optimal models are identified and further analyzed using human prompts collected through a dedicated observer study. Our findings are: 1) The segmentation performance varies a lot between FMs and prompting strategies; 2) The Pareto-optimal models in 2D are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging and Analysis · Artificial Intelligence in Healthcare and Education · Advanced X-ray and CT Imaging
