Open-Set Domain Adaptation with Visual-Language Foundation Models

Qing Yu; Go Irie; Kiyoharu Aizawa

arXiv:2307.16204·cs.CV·August 1, 2023

Open-Set Domain Adaptation with Visual-Language Foundation Models

Qing Yu, Go Irie, Kiyoharu Aizawa

PDF

Open Access

TL;DR

This paper leverages visual-language foundation models like CLIP for open-set domain adaptation, introducing an entropy optimization strategy that enhances zero-shot prediction and achieves state-of-the-art results on benchmarks.

Contribution

It proposes a novel method to adapt CLIP for open-set domain adaptation, improving performance without extensive fine-tuning.

Findings

01

CLIP-based zero-shot prediction performs well in ODA tasks

02

Entropy optimization enhances ODA model accuracy

03

Achieves state-of-the-art results on multiple benchmarks

Abstract

Unsupervised domain adaptation (UDA) has proven to be very effective in transferring knowledge obtained from a source domain with labeled data to a target domain with unlabeled data. Owing to the lack of labeled data in the target domain and the possible presence of unknown classes, open-set domain adaptation (ODA) has emerged as a potential solution to identify these classes during the training phase. Although existing ODA approaches aim to solve the distribution shifts between the source and target domains, most methods fine-tuned ImageNet pre-trained models on the source domain with the adaptation on the target domain. Recent visual-language foundation models (VLFM), such as Contrastive Language-Image Pre-Training (CLIP), are robust to many distribution shifts and, therefore, should substantially improve the performance of ODA. In this work, we explore generic ways to adopt CLIP, a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Cancer-related molecular mechanisms research

MethodsContrastive Language-Image Pre-training