COCA: Classifier-Oriented Calibration via Textual Prototype for Source-Free Universal Domain Adaptation
Xinghong Liu, Yi Zhou, Tao Zhou, Chun-Mei Feng, Ling Shao

TL;DR
This paper introduces COCA, a classifier-oriented calibration method leveraging textual prototypes and vision-language models to improve source-free universal domain adaptation, reducing labeling costs and enhancing unknown class detection.
Contribution
COCA is a novel plug-and-play calibration approach that uses textual prototypes with VLMs, focusing on classifiers rather than image encoder optimization for SF-UniDA.
Findings
COCA outperforms existing UniDA and SF-UniDA methods in experiments.
It enables few-shot learners to distinguish unknown classes effectively.
COCA reduces the need for extensive labeled source data.
Abstract
Universal domain adaptation (UniDA) aims to address domain and category shifts across data sources. Recently, due to more stringent data restrictions, researchers have introduced source-free UniDA (SF-UniDA). SF-UniDA methods eliminate the need for direct access to source samples when performing adaptation to the target domain. However, existing SF-UniDA methods still require an extensive quantity of labeled source samples to train a source model, resulting in significant labeling costs. To tackle this issue, we present a novel plug-and-play classifier-oriented calibration (COCA) method. COCA, which exploits textual prototypes, is designed for the source models based on few-shot learning with vision-language models (VLMs). It endows the VLM-powered few-shot learners, which are built for closed-set classification, with the unknown-aware ability to distinguish common and unknown classes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning
