Predicting Classification Accuracy When Adding New Unobserved Classes

Yuli Slavutsky; Yuval Benjamini

arXiv:2010.15011·cs.LG·May 29, 2024

Predicting Classification Accuracy When Adding New Unobserved Classes

Yuli Slavutsky, Yuval Benjamini

PDF

Open Access 1 Video

TL;DR

This paper introduces a method to predict the accuracy of multiclass classifiers on unseen classes by leveraging a novel measure called reversed ROC, enabling better extrapolation from limited class samples.

Contribution

The authors propose a new measure, rROC, and a neural network-based algorithm, CleaneX, for accurately estimating classifier accuracy on large, unobserved class sets.

Findings

01

CleaneX outperforms existing methods in accuracy prediction.

02

The rROC measure effectively relates classifier scores to accuracy.

03

Method validated on object detection, face recognition, and brain decoding datasets.

Abstract

Multiclass classifiers are often designed and evaluated only on a sample from the classes on which they will eventually be applied. Hence, their final accuracy remains unknown. In this work we study how a classifier's performance over the initial class sample can be used to extrapolate its expected accuracy on a larger, unobserved set of classes. For this, we define a measure of separation between correct and incorrect classes that is independent of the number of classes: the "reversed ROC" (rROC), which is obtained by replacing the roles of classes and data-points in the common ROC. We show that the classification accuracy is a function of the rROC in multiclass classifiers, for which the learned representation of data from the initial class sample remains unchanged when new classes are added. Using these results we formulate a robust neural-network-based algorithm, "CleaneX", which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Predicting Classification Accuracy When Adding New Unobserved Classes· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications · Imbalanced Data Classification Techniques