Do Vision Foundation Models Enhance Domain Generalization in Medical   Image Segmentation?

Kerem Cekmeceli; Meva Himmetoglu; Guney I. Tombak; Anna Susmelj,; Ertunc Erdil; Ender Konukoglu

arXiv:2409.07960·cs.CV·September 13, 2024

Do Vision Foundation Models Enhance Domain Generalization in Medical Image Segmentation?

Kerem Cekmeceli, Meva Himmetoglu, Guney I. Tombak, Anna Susmelj,, Ertunc Erdil, Ender Konukoglu

PDF

Open Access 1 Repo

TL;DR

This study evaluates the ability of vision foundation models, enhanced with novel fine-tuning techniques and a new decoder head, to improve domain generalization in medical image segmentation across diverse datasets and clinical settings.

Contribution

It introduces a new decode head architecture, HQHSAM, and systematically assesses the domain generalization capabilities of various foundation models with different fine-tuning methods in medical segmentation.

Findings

01

FMs with HQHSAM improve segmentation across domains.

02

PEFT techniques' effectiveness varies among models.

03

FMs show promise for robust medical image segmentation.

Abstract

Neural networks achieve state-of-the-art performance in many supervised learning tasks when the training data distribution matches the test data distribution. However, their performance drops significantly under domain (covariate) shift, a prevalent issue in medical image segmentation due to varying acquisition settings across different scanner models and protocols. Recently, foundational models (FMs) trained on large datasets have gained attention for their ability to be adapted for downstream tasks and achieve state-of-the-art performance with excellent generalization capabilities on natural images. However, their effectiveness in medical image segmentation remains underexplored. In this paper, we investigate the domain generalization performance of various FMs, including DinoV2, SAM, MedSAM, and MAE, when fine-tuned using various parameter-efficient fine-tuning (PEFT) techniques such…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kerem-cekmeceli/foundation-models-for-medical-imagery
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · AI in cancer detection · Domain Adaptation and Few-Shot Learning

MethodsSoftmax · Attention Is All You Need · Masked autoencoder · Segment Anything Model