Inequalities for Optimization of Classification Algorithms: A Perspective Motivated by Diagnostic Testing

Paul N. Patrone; Anthony J. Kearsley

arXiv:2508.01065·stat.ML·August 5, 2025

Inequalities for Optimization of Classification Algorithms: A Perspective Motivated by Diagnostic Testing

Paul N. Patrone, Anthony J. Kearsley

PDF

Open Access

TL;DR

This paper introduces a novel objective function based on the Gershgorin circle theorem to provide uniform error bounds for classification and prevalence estimation, motivated by medical diagnostics.

Contribution

It presents a set-theoretic approach and a measure-theoretic optimization to minimize the Gershgorin radius for improved error bounds in classifiers.

Findings

01

Gershgorin radius bounds errors in classification and prevalence estimation.

02

Optimal partitioning minimizes the Gershgorin radius in binary classification.

03

Multi-class extension presents additional challenges and properties.

Abstract

Motivated by canonical problems in medical diagnostics, we propose and study properties of an objective function that uniformly bounds uncertainties in quantities of interest extracted from classifiers and related data analysis tools. We begin by adopting a set-theoretic perspective to show how two main tasks in diagnostics -- classification and prevalence estimation -- can be recast in terms of a variation on the confusion (or error) matrix $P$ typically considered in supervised learning. We then combine arguments from conditional probability with the Gershgorin circle theorem to demonstrate that the largest Gershgorin radius $ρ_{m}$ of the matrix $I - P$ (where $I$ is the identity) yields uniform error bounds for both classification and prevalence estimation. In a two-class setting, $ρ_{m}$ is minimized via…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Bayesian Modeling and Causal Inference · Statistical Methods and Inference