Multi-Prompt Progressive Alignment for Multi-Source Unsupervised Domain Adaptation
Haoran Chen, Zexiao Wang, Haidong Cao, Zuxuan Wu, and Yu-Gang Jiang

TL;DR
This paper introduces MP^2A, a progressive alignment method for multi-source unsupervised domain adaptation using CLIP, which gradually incorporates target samples to improve robustness and achieve state-of-the-art results.
Contribution
The paper proposes a novel progressive alignment strategy for CLIP-based multi-source UDA, addressing noise and domain gap issues more effectively than existing one-shot methods.
Findings
MP^2A outperforms recent CLIP-based MS-UDA methods on benchmarks.
Gradual sample incorporation improves domain-invariant feature learning.
Method demonstrates robustness against noisy and hard-to-classify samples.
Abstract
Large Vision-Language Models like CLIP have become a powerful foundation for Unsupervised Domain Adaptation due to their strong zero-shot generalization. State-of-the-art methods typically leverage CLIP to generate pseudo-labels for the target domain, then fine-tune the model to learn domain-invariant features. However, these methods attempt to align source and target domains using all pseudo-labeled data simultaneously. This one-shot alignment struggles with noisy, hard-to-classify samples, leading to error propagation and suboptimal feature learning. The problem is even more amplified in the multi-source scenario, where diverse domain gaps and varying noise levels across multiple source domains further destabilize the alignment process. To address this issue, in this work, we propose a progressive alignment strategy for adapting CLIP to unlabeled downstream task. Our method begins by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
