Domain Prompt Learning for Efficiently Adapting CLIP to Unseen Domains
Xin Zhang, Shixiang Shane Gu, Yutaka Matsuo, Yusuke Iwasawa

TL;DR
This paper introduces Domain Prompt Learning (DPL), a lightweight method to adapt CLIP for unseen domain generalization in image classification, significantly improving zero-shot accuracy across multiple benchmarks.
Contribution
The paper proposes DPL, a novel prompt generation approach, enabling effective domain adaptation of CLIP without extensive fine-tuning, enhancing zero-shot performance in DG tasks.
Findings
DPL improves zero-shot CLIP accuracy from 73.7% to 79.3%.
DPL achieves significant gains on PACS, VLCS, OfficeHome, TerraIncognita.
A lightweight prompt generator suffices for effective domain adaptation.
Abstract
Domain generalization (DG) is a difficult transfer learning problem aiming to learn a generalizable model for unseen domains. Recent foundation models (FMs) are robust to many distribution shifts and, therefore, should substantially improve the performance of DG. In this work, we study generic ways to adopt CLIP, a Visual-Language Foundation Model, for DG problems in image classification. While ERM greatly improves the accuracy with bigger backbones and training datasets using standard DG benchmarks, fine-tuning FMs is not practical in many real-world situations. We propose Domain Prompt Learning (DPL) as a novel approach for domain inference in the form of conditional prompt generation. DPL achieved a significant accuracy improvement with only training a lightweight prompt generator (a three-layer MLP), whose parameter is of equivalent scale to the classification projector in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · COVID-19 diagnosis using AI
Methods15 Ways to Contact How can i speak to someone at Delta Airlines · Multi-Head Attention · Attention Is All You Need · Linear Layer · Cosine Annealing · Softmax · Residual Connection · {Dispute@FaQ-s}How to file a dispute with Expedia? · Linear Warmup With Cosine Annealing · Layer Normalization
