OpenworldAUC: Towards Unified Evaluation and Optimization for Open-world Prompt Tuning
Cong Hua, Qianqian Xu, Zhiyong Yang, Zitai Wang, Shilong, Bao, Qingming Huang

TL;DR
This paper introduces OpenworldAUC, a unified evaluation metric for open-world prompt tuning that assesses both detection and classification, and proposes Gated Mixture-of-Prompts (GMoP) for optimized performance, demonstrating state-of-the-art results across benchmarks.
Contribution
The paper proposes OpenworldAUC as a new unified metric and GMoP as an optimization method for open-world prompt tuning, addressing evaluation and training challenges.
Findings
OpenworldAUC effectively evaluates detection and classification jointly.
GMoP achieves state-of-the-art performance on 15 benchmarks.
The method is insensitive to varying base/new sample ratios.
Abstract
Prompt tuning adapts Vision-Language Models like CLIP to open-world tasks with minimal training costs. In this direction, one typical paradigm evaluates model performance separately on known classes (i.e., base domain) and unseen classes (i.e., new domain). However, real-world scenarios require models to handle inputs without prior domain knowledge. This practical challenge has spurred the development of open-world prompt tuning, which demands a unified evaluation of two stages: 1) detecting whether an input belongs to the base or new domain (P1), and 2) classifying the sample into its correct class (P2). What's more, as domain distributions are generally unknown, a proper metric should be insensitive to varying base/new sample ratios (P3). However, we find that current metrics, including HM, overall accuracy, and AUROC, fail to satisfy these three properties simultaneously. To bridge…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Advanced Neural Network Applications
MethodsBalanced Selection · Contrastive Language-Image Pre-training
