Inequalities for Optimization of Classification Algorithms: A Perspective Motivated by Diagnostic Testing
Paul N. Patrone, Anthony J. Kearsley

TL;DR
This paper introduces a novel objective function based on the Gershgorin circle theorem to provide uniform error bounds for classification and prevalence estimation, motivated by medical diagnostics.
Contribution
It presents a set-theoretic approach and a measure-theoretic optimization to minimize the Gershgorin radius for improved error bounds in classifiers.
Findings
Gershgorin radius bounds errors in classification and prevalence estimation.
Optimal partitioning minimizes the Gershgorin radius in binary classification.
Multi-class extension presents additional challenges and properties.
Abstract
Motivated by canonical problems in medical diagnostics, we propose and study properties of an objective function that uniformly bounds uncertainties in quantities of interest extracted from classifiers and related data analysis tools. We begin by adopting a set-theoretic perspective to show how two main tasks in diagnostics -- classification and prevalence estimation -- can be recast in terms of a variation on the confusion (or error) matrix typically considered in supervised learning. We then combine arguments from conditional probability with the Gershgorin circle theorem to demonstrate that the largest Gershgorin radius of the matrix (where is the identity) yields uniform error bounds for both classification and prevalence estimation. In a two-class setting, is minimized via…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Bayesian Modeling and Causal Inference · Statistical Methods and Inference
