Do Vision Foundation Models Enhance Domain Generalization in Medical Image Segmentation?
Kerem Cekmeceli, Meva Himmetoglu, Guney I. Tombak, Anna Susmelj,, Ertunc Erdil, Ender Konukoglu

TL;DR
This study evaluates the ability of vision foundation models, enhanced with novel fine-tuning techniques and a new decoder head, to improve domain generalization in medical image segmentation across diverse datasets and clinical settings.
Contribution
It introduces a new decode head architecture, HQHSAM, and systematically assesses the domain generalization capabilities of various foundation models with different fine-tuning methods in medical segmentation.
Findings
FMs with HQHSAM improve segmentation across domains.
PEFT techniques' effectiveness varies among models.
FMs show promise for robust medical image segmentation.
Abstract
Neural networks achieve state-of-the-art performance in many supervised learning tasks when the training data distribution matches the test data distribution. However, their performance drops significantly under domain (covariate) shift, a prevalent issue in medical image segmentation due to varying acquisition settings across different scanner models and protocols. Recently, foundational models (FMs) trained on large datasets have gained attention for their ability to be adapted for downstream tasks and achieve state-of-the-art performance with excellent generalization capabilities on natural images. However, their effectiveness in medical image segmentation remains underexplored. In this paper, we investigate the domain generalization performance of various FMs, including DinoV2, SAM, MedSAM, and MAE, when fine-tuned using various parameter-efficient fine-tuning (PEFT) techniques such…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · AI in cancer detection · Domain Adaptation and Few-Shot Learning
MethodsSoftmax · Attention Is All You Need · Masked autoencoder · Segment Anything Model
