Multi-Prompt Progressive Alignment for Multi-Source Unsupervised Domain Adaptation

Haoran Chen; Zexiao Wang; Haidong Cao; Zuxuan Wu; and Yu-Gang Jiang

arXiv:2507.23373·cs.CV·August 1, 2025

Multi-Prompt Progressive Alignment for Multi-Source Unsupervised Domain Adaptation

Haoran Chen, Zexiao Wang, Haidong Cao, Zuxuan Wu, and Yu-Gang Jiang

PDF

Open Access

TL;DR

This paper introduces MP^2A, a progressive alignment method for multi-source unsupervised domain adaptation using CLIP, which gradually incorporates target samples to improve robustness and achieve state-of-the-art results.

Contribution

The paper proposes a novel progressive alignment strategy for CLIP-based multi-source UDA, addressing noise and domain gap issues more effectively than existing one-shot methods.

Findings

01

MP^2A outperforms recent CLIP-based MS-UDA methods on benchmarks.

02

Gradual sample incorporation improves domain-invariant feature learning.

03

Method demonstrates robustness against noisy and hard-to-classify samples.

Abstract

Large Vision-Language Models like CLIP have become a powerful foundation for Unsupervised Domain Adaptation due to their strong zero-shot generalization. State-of-the-art methods typically leverage CLIP to generate pseudo-labels for the target domain, then fine-tune the model to learn domain-invariant features. However, these methods attempt to align source and target domains using all pseudo-labeled data simultaneously. This one-shot alignment struggles with noisy, hard-to-classify samples, leading to error propagation and suboptimal feature learning. The problem is even more amplified in the multi-source scenario, where diverse domain gaps and varying noise levels across multiple source domains further destabilize the alignment process. To address this issue, in this work, we propose a progressive alignment strategy for adapting CLIP to unlabeled downstream task. Our method begins by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications