The PAV algorithm optimizes binary proper scoring rules

Niko Brummer; Johan du Preez

arXiv:1304.2331·stat.AP·April 11, 2013·27 cites

The PAV algorithm optimizes binary proper scoring rules

Niko Brummer, Johan du Preez

PDF

Open Access

TL;DR

The paper demonstrates that the PAV algorithm optimally calibrates binary classifiers under all regular proper scoring rules, extending previous convexity limitations and applying to probabilities and log-likelihood ratios.

Contribution

It proves the optimality of the PAV algorithm for binary calibration under all regular proper scoring rules, beyond convex cases, including log-likelihood ratios.

Findings

01

PAV solves non-parametric calibration with monotonicity constraints.

02

Optimality holds for all regular binary proper scoring rules.

03

Results apply to probabilities and log-likelihood ratios.

Abstract

There has been much recent interest in application of the pool-adjacent-violators (PAV) algorithm for the purpose of calibrating the probabilistic outputs of automatic pattern recognition and machine learning algorithms. Special cost functions, known as proper scoring rules form natural objective functions to judge the goodness of such calibration. We show that for binary pattern classifiers, the non-parametric optimization of calibration, subject to a monotonicity constraint, can be solved by PAV and that this solution is optimal for all regular binary proper scoring rules. This extends previous results which were limited to convex binary proper scoring rules. We further show that this result holds not only for calibration of probabilities, but also for calibration of log-likelihood-ratios, in which case optimality holds independently of the prior probabilities of the pattern classes.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImbalanced Data Classification Techniques · Machine Learning and Data Classification · Bayesian Modeling and Causal Inference