Tell2Adapt: A Unified Framework for Source Free Unsupervised Domain Adaptation via Vision Foundation Model

Yulong Shi; Shijie Li; Ziyi Li; Lin Qi

arXiv:2603.05012·cs.CV·March 6, 2026

Tell2Adapt: A Unified Framework for Source Free Unsupervised Domain Adaptation via Vision Foundation Model

Yulong Shi, Shijie Li, Ziyi Li, Lin Qi

PDF

Open Access

TL;DR

Tell2Adapt is a novel source-free unsupervised domain adaptation framework that leverages Vision Foundation Models to improve medical image segmentation across diverse clinical domains with high reliability.

Contribution

It introduces a unified SFUDA framework utilizing Vision Foundation Models with novel prompts regularization and visual plausibility refinement for better generalization and accuracy.

Findings

01

Outperforms existing methods across 10 domain directions and 22 anatomical targets.

02

Achieves state-of-the-art results in medical image segmentation.

03

Validated on diverse clinical datasets including brain, cardiac, and abdominal images.

Abstract

Source Free Unsupervised Domain Adaptation (SFUDA) is critical for deploying deep learning models across diverse clinical settings. However, existing methods are typically designed for low-gap, specific domain shifts and cannot generalize into a unified, multi-modalities, and multi-target framework, which presents a major barrier to real-world application. To overcome this issue, we introduce Tell2Adapt, a novel SFUDA framework that harnesses the vast, generalizable knowledge of the Vision Foundation Model (VFM). Our approach ensures high-fidelity VFM prompts through Context-Aware Prompts Regularization (CAPR), which robustly translates varied text prompts into canonical instructions. This enables the generation of high-quality pseudo-labels for efficiently adapting the lightweight student model to target domain. To guarantee clinical reliability, the framework incorporates Visual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Face recognition and analysis