AD-CLIP: Adapting Domains in Prompt Space Using CLIP
Mainak Singha, Harsh Pal, Ankit Jha, Biplab Banerjee

TL;DR
AD-CLIP introduces a domain-agnostic prompt learning method leveraging CLIP's vision-language model to improve unsupervised domain adaptation by aligning domain and class features in the prompt space.
Contribution
It proposes a novel prompt learning strategy in CLIP for domain adaptation, including style-content conditioning and a style mapping network for target-only scenarios.
Findings
Outperforms existing domain adaptation methods on benchmark datasets.
Effectively aligns source and target domains in embedding space.
Demonstrates robustness in target-only testing scenarios.
Abstract
Although deep learning models have shown impressive performance on supervised learning tasks, they often struggle to generalize well when the training (source) and test (target) domains differ. Unsupervised domain adaptation (DA) has emerged as a popular solution to this problem. However, current DA techniques rely on visual backbones, which may lack semantic richness. Despite the potential of large-scale vision-language foundation models like CLIP, their effectiveness for DA has yet to be fully explored. To address this gap, we introduce \textsc{AD-CLIP}, a domain-agnostic prompt learning strategy for CLIP that aims to solve the DA problem in the prompt space. We leverage the frozen vision backbone of CLIP to extract both image style (domain) and content information, which we apply to learn prompt tokens. Our prompts are designed to be domain-invariant and class-generalizable, by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Cancer-related molecular mechanisms research · Multimodal Machine Learning Applications
MethodsContrastive Learning · Contrastive Language-Image Pre-training · ALIGN
