TL;DR
MedSAMix is a training-free, model merging approach that combines generalist and specialist medical image segmentation models, improving accuracy and generalization across diverse tasks without additional training.
Contribution
It introduces a zero-order optimization method for automatic layer-wise model merging, enhancing medical segmentation performance without retraining.
Findings
Achieves 6.67% improvement on specialized tasks
Achieves 4.37% improvement on multi-task evaluations
Effectively mitigates model bias and enhances generalization
Abstract
Universal medical image segmentation models have emerged as a promising paradigm due to their strong generalizability across diverse tasks, showing great potential for a wide range of clinical applications. This potential has been partly driven by the success of general-purpose vision models such as the Segment Anything Model (SAM), which has inspired the development of various fine-tuned variants for medical segmentation tasks. However, fine-tuned variants like MedSAM are trained on comparatively limited medical imaging data that often suffers from heterogeneity, scarce annotations, and distributional shifts. These challenges limit their ability to generalize across a wide range of medical segmentation tasks. In this regard, we propose MedSAMix, a training-free model merging method that integrates the strengths of both generalist models (e.g., SAM) and specialist models (e.g., MedSAM)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
