Vision Foundation Models in Medical Image Analysis: Advances and   Challenges

Pengchen Liang; Bin Pu; Haishan Huang; Yiwei Li; Hualiang Wang; Weibo; Ma; Qing Chang

arXiv:2502.14584·eess.IV·February 24, 2025·2 cites

Vision Foundation Models in Medical Image Analysis: Advances and Challenges

Pengchen Liang, Bin Pu, Haishan Huang, Yiwei Li, Hualiang Wang, Weibo, Ma, Qing Chang

PDF

Open Access

TL;DR

This paper reviews recent advances in Vision Foundation Models like ViT and SAM for medical image segmentation, highlighting challenges, solutions, and future directions to improve clinical applications.

Contribution

It provides a comprehensive overview of how VFMs are adapted to medical imaging, focusing on domain adaptation, model compression, and federated learning techniques.

Findings

01

VFMs show strong potential for medical image segmentation.

02

Challenges include domain differences and limited datasets.

03

Emerging methods like federated learning and knowledge distillation are promising.

Abstract

The rapid development of Vision Foundation Models (VFMs), particularly Vision Transformers (ViT) and Segment Anything Model (SAM), has sparked significant advances in the field of medical image analysis. These models have demonstrated exceptional capabilities in capturing long-range dependencies and achieving high generalization in segmentation tasks. However, adapting these large models to medical image analysis presents several challenges, including domain differences between medical and natural images, the need for efficient model adaptation strategies, and the limitations of small-scale medical datasets. This paper reviews the state-of-the-art research on the adaptation of VFMs to medical image segmentation, focusing on the challenges of domain adaptation, model compression, and federated learning. We discuss the latest developments in adapter-based improvements, knowledge…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsBrain Tumor Detection and Classification · Advanced Image Fusion Techniques · Image Retrieval and Classification Techniques

MethodsKnowledge Distillation