COSMo: CLIP Talks on Open-Set Multi-Target Domain Adaptation

Munish Monga; Sachin Kumar Giroh; Ankit Jha; Mainak Singha; Biplab; Banerjee; Jocelyn Chanussot

arXiv:2409.00397·cs.CV·December 17, 2024

COSMo: CLIP Talks on Open-Set Multi-Target Domain Adaptation

Munish Monga, Sachin Kumar Giroh, Ankit Jha, Mainak Singha, Biplab, Banerjee, Jocelyn Chanussot

PDF

Open Access 1 Repo

TL;DR

COSMo introduces a novel prompt learning approach using CLIP for open-set multi-target domain adaptation, effectively handling domain and class shifts with improved performance on multiple datasets.

Contribution

It is the first method to address open-set multi-target domain adaptation by learning domain-agnostic prompts guided by source domain information in the prompt space.

Findings

01

Achieves an average of 5.1% improvement over existing methods on three datasets.

02

Effectively handles both open-set and multi-target domain shifts.

03

Demonstrates the potential of CLIP-based prompt learning for complex domain adaptation tasks.

Abstract

Multi-Target Domain Adaptation (MTDA) entails learning domain-invariant information from a single source domain and applying it to multiple unlabeled target domains. Yet, existing MTDA methods predominantly focus on addressing domain shifts within visual features, often overlooking semantic features and struggling to handle unknown classes, resulting in what is known as Open-Set (OS) MTDA. While large-scale vision-language foundation models like CLIP show promise, their potential for MTDA remains largely unexplored. This paper introduces COSMo, a novel method that learns domain-agnostic prompts through source domain-guided prompt learning to tackle the MTDA problem in the prompt space. By leveraging a domain-specific bias network and separate prompts for known and unknown classes, COSMo effectively adapts across domain and class shifts. To the best of our knowledge, COSMo is the first…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

munish30monga/COSMo
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsContrastive Language-Image Pre-training · Focus