Preserving Clusters in Prompt Learning for Unsupervised Domain Adaptation

Tung-Long Vuong; Hoang Phan; Vy Vo; Anh Bui; Thanh-Toan Do; Trung Le; Dinh Phung

arXiv:2506.11493·cs.CV·June 16, 2025

Preserving Clusters in Prompt Learning for Unsupervised Domain Adaptation

Tung-Long Vuong, Hoang Phan, Vy Vo, Anh Bui, Thanh-Toan Do, Trung Le, Dinh Phung

PDF

Open Access

TL;DR

This paper introduces a novel method for unsupervised domain adaptation using multi-modal models, focusing on preserving cluster structures in visual and text embeddings to improve target domain alignment and prompt quality.

Contribution

It proposes a new approach leveraging embedding geometry and optimal transport to reinforce pseudo-labels and enhance clustering in multi-modal prompt learning for UDA.

Findings

01

Improved target domain alignment in experiments.

02

Enhanced quality of target prompts.

03

Superior performance over existing methods.

Abstract

Recent approaches leveraging multi-modal pre-trained models like CLIP for Unsupervised Domain Adaptation (UDA) have shown significant promise in bridging domain gaps and improving generalization by utilizing rich semantic knowledge and robust visual representations learned through extensive pre-training on diverse image-text datasets. While these methods achieve state-of-the-art performance across benchmarks, much of the improvement stems from base pseudo-labels (CLIP zero-shot predictions) and self-training mechanisms. Thus, the training mechanism exhibits a key limitation wherein the visual embedding distribution in target domains can deviate from the visual embedding distribution in the pre-trained model, leading to misguided signals from class descriptions. This work introduces a fresh solution to reinforce these pseudo-labels and facilitate target-prompt learning, by exploiting the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning