P-values for classification

Lutz Duembgen; Bernd-Wolfgang Igl; Axel Munk

arXiv:0801.2934·math.ST·June 26, 2008

P-values for classification

Lutz Duembgen, Bernd-Wolfgang Igl, Axel Munk

PDF

TL;DR

This paper introduces a method to generate p-values for classification tasks, providing confidence measures for class predictions, which enhances the interpretability and reliability of classifiers.

Contribution

The paper proposes a novel approach to produce nonparametric p-values for each class, transforming point predictions into confidence regions, and discusses its advantages over traditional methods.

Findings

01

P-values offer confidence measures for class predictions.

02

Any reasonable classifier can be adapted to produce these p-values.

03

The approach improves interpretability and reliability of classification results.

Abstract

Let $(X, Y)$ be a random variable consisting of an observed feature vector $X \in X$ and an unobserved class label $Y \in {1, 2, ..., L}$ with unknown joint distribution. In addition, let $D$ be a training data set consisting of $n$ completely observed independent copies of $(X, Y)$ . Usual classification procedures provide point predictors (classifiers) $Y (X, D)$ of $Y$ or estimate the conditional distribution of $Y$ given $X$ . In order to quantify the certainty of classifying $X$ we propose to construct for each $θ = 1, 2, ..., L$ a p-value $π_{θ} (X, D)$ for the null hypothesis that $Y = θ$ , treating $Y$ temporarily as a fixed parameter. In other words, the point predictor $Y (X, D)$ is replaced with a prediction region for $Y$ with a certain confidence. We argue that (i) this approach is advantageous over…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.