Revisiting Unknowns: Towards Effective and Efficient Open-Set Active Learning
Chen-Chen Zong, Yu-Qi Chi, Xie-Yang Wang, Yan Cui, Sheng-Jun Huang

TL;DR
This paper introduces E$^2$OAL, a unified, detector-free open-set active learning framework that effectively leverages labeled unknowns to improve learning efficiency and accuracy in scenarios with unseen classes.
Contribution
E$^2$OAL is the first to fully exploit labeled unknowns in a unified, detector-free framework, enhancing supervision and query reliability in open-set active learning.
Findings
E$^2$OAL outperforms state-of-the-art methods in accuracy and efficiency.
The framework demonstrates robustness with minimal hyperparameter sensitivity.
Extensive experiments validate its effectiveness across multiple benchmarks.
Abstract
Open-set active learning (OSAL) aims to identify informative samples for annotation when unlabeled data may contain previously unseen classes-a common challenge in safety-critical and open-world scenarios. Existing approaches typically rely on separately trained open-set detectors, introducing substantial training overhead and overlooking the supervisory value of labeled unknowns for improving known-class learning. In this paper, we propose EOAL (Effective and Efficient Open-set Active Learning), a unified and detector-free framework that fully exploits labeled unknowns for both stronger supervision and more reliable querying. EOAL first uncovers the latent class structure of unknowns through label-guided clustering in a frozen contrastively pre-trained feature space, optimized by a structure-aware F1-product objective. To leverage labeled unknowns, it employs a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
