AD-CLIP: Adapting Domains in Prompt Space Using CLIP

Mainak Singha; Harsh Pal; Ankit Jha; Biplab Banerjee

arXiv:2308.05659·cs.CV·September 17, 2024·1 cites

AD-CLIP: Adapting Domains in Prompt Space Using CLIP

Mainak Singha, Harsh Pal, Ankit Jha, Biplab Banerjee

PDF

Open Access 1 Repo

TL;DR

AD-CLIP introduces a domain-agnostic prompt learning method leveraging CLIP's vision-language model to improve unsupervised domain adaptation by aligning domain and class features in the prompt space.

Contribution

It proposes a novel prompt learning strategy in CLIP for domain adaptation, including style-content conditioning and a style mapping network for target-only scenarios.

Findings

01

Outperforms existing domain adaptation methods on benchmark datasets.

02

Effectively aligns source and target domains in embedding space.

03

Demonstrates robustness in target-only testing scenarios.

Abstract

Although deep learning models have shown impressive performance on supervised learning tasks, they often struggle to generalize well when the training (source) and test (target) domains differ. Unsupervised domain adaptation (DA) has emerged as a popular solution to this problem. However, current DA techniques rely on visual backbones, which may lack semantic richness. Despite the potential of large-scale vision-language foundation models like CLIP, their effectiveness for DA has yet to be fully explored. To address this gap, we introduce \textsc{AD-CLIP}, a domain-agnostic prompt learning strategy for CLIP that aims to solve the DA problem in the prompt space. We leverage the frozen vision backbone of CLIP to extract both image style (domain) and content information, which we apply to learn prompt tokens. Our prompts are designed to be domain-invariant and class-generalizable, by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mainaksingha01/AD-CLIP
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Cancer-related molecular mechanisms research · Multimodal Machine Learning Applications

MethodsContrastive Learning · Contrastive Language-Image Pre-training · ALIGN