COSMo: CLIP Talks on Open-Set Multi-Target Domain Adaptation
Munish Monga, Sachin Kumar Giroh, Ankit Jha, Mainak Singha, Biplab, Banerjee, Jocelyn Chanussot

TL;DR
COSMo introduces a novel prompt learning approach using CLIP for open-set multi-target domain adaptation, effectively handling domain and class shifts with improved performance on multiple datasets.
Contribution
It is the first method to address open-set multi-target domain adaptation by learning domain-agnostic prompts guided by source domain information in the prompt space.
Findings
Achieves an average of 5.1% improvement over existing methods on three datasets.
Effectively handles both open-set and multi-target domain shifts.
Demonstrates the potential of CLIP-based prompt learning for complex domain adaptation tasks.
Abstract
Multi-Target Domain Adaptation (MTDA) entails learning domain-invariant information from a single source domain and applying it to multiple unlabeled target domains. Yet, existing MTDA methods predominantly focus on addressing domain shifts within visual features, often overlooking semantic features and struggling to handle unknown classes, resulting in what is known as Open-Set (OS) MTDA. While large-scale vision-language foundation models like CLIP show promise, their potential for MTDA remains largely unexplored. This paper introduces COSMo, a novel method that learns domain-agnostic prompts through source domain-guided prompt learning to tackle the MTDA problem in the prompt space. By leveraging a domain-specific bias network and separate prompts for known and unknown classes, COSMo effectively adapts across domain and class shifts. To the best of our knowledge, COSMo is the first…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
MethodsContrastive Language-Image Pre-training · Focus
