Open Category Detection with PAC Guarantees
Si Liu, Risheek Garrepalli, Thomas G. Dietterich, Alan Fern, Dan, Hendrycks

TL;DR
This paper introduces a PAC-guaranteed algorithm for open category detection that effectively identifies alien instances in contaminated datasets, addressing key theoretical and empirical gaps in the field.
Contribution
It proposes the first algorithm with PAC guarantees for alien detection in contaminated training data, balancing detection accuracy and false alarms.
Findings
Algorithm achieves PAC-style guarantees on detection rate
Empirical results validate effectiveness on synthetic and benchmark datasets
Provides a baseline for future research in open category detection
Abstract
Open category detection is the problem of detecting "alien" test instances that belong to categories or classes that were not present in the training data. In many applications, reliably detecting such aliens is central to ensuring the safety and accuracy of test set predictions. Unfortunately, there are no algorithms that provide theoretical guarantees on their ability to detect aliens under general assumptions. Further, while there are algorithms for open category detection, there are few empirical results that directly report alien detection rates. Thus, there are significant theoretical and empirical gaps in our understanding of open category detection. In this paper, we take a step toward addressing this gap by studying a simple, but practically-relevant variant of open category detection. In our setting, we are provided with a "clean" training set that contains only the target…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications · Machine Learning and Algorithms
