The Certainty Ratio $C_\rho$: a novel metric for assessing the reliability of classifier predictions
Jesus S. Aguilar-Ruiz

TL;DR
The paper introduces the Certainty Ratio ($C_\rho$), a new metric that assesses classifier reliability by distinguishing confident from uncertain predictions, providing deeper insights than traditional accuracy measures.
Contribution
It proposes the Certainty Ratio ($C_\rho$), integrating probabilistic confusion matrices to evaluate classifier trustworthiness more comprehensively.
Findings
$C_\rho$ reveals insights overlooked by traditional metrics.
Experimental validation across 21 datasets and multiple classifiers.
Highlights the importance of probabilistic information in classifier evaluation.
Abstract
Evaluating the performance of classifiers is critical in machine learning, particularly in high-stakes applications where the reliability of predictions can significantly impact decision-making. Traditional performance measures, such as accuracy and F-score, often fail to account for the uncertainty inherent in classifier predictions, leading to potentially misleading assessments. This paper introduces the Certainty Ratio (), a novel metric designed to quantify the contribution of confident (certain) versus uncertain predictions to any classification performance measure. By integrating the Probabilistic Confusion Matrix () and decomposing predictions into certainty and uncertainty components, provides a more comprehensive evaluation of classifier reliability. Experimental results across 21 datasets and multiple classifiers, including Decision Trees,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Machine Learning and Data Classification · Explainable Artificial Intelligence (XAI)
