P-values for classification
Lutz Duembgen, Bernd-Wolfgang Igl, Axel Munk

TL;DR
This paper introduces a method to generate p-values for classification tasks, providing confidence measures for class predictions, which enhances the interpretability and reliability of classifiers.
Contribution
The paper proposes a novel approach to produce nonparametric p-values for each class, transforming point predictions into confidence regions, and discusses its advantages over traditional methods.
Findings
P-values offer confidence measures for class predictions.
Any reasonable classifier can be adapted to produce these p-values.
The approach improves interpretability and reliability of classification results.
Abstract
Let be a random variable consisting of an observed feature vector and an unobserved class label with unknown joint distribution. In addition, let be a training data set consisting of completely observed independent copies of . Usual classification procedures provide point predictors (classifiers) of or estimate the conditional distribution of given . In order to quantify the certainty of classifying we propose to construct for each a p-value for the null hypothesis that , treating temporarily as a fixed parameter. In other words, the point predictor is replaced with a prediction region for with a certain confidence. We argue that (i) this approach is advantageous over…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
