Leveraging Uncertainty Estimates To Improve Classifier Performance
Gundeep Arora, Srujana Merugu, Anoop Saladi, Rajeev Rastogi

TL;DR
This paper introduces methods that incorporate uncertainty estimates into classifier decision boundaries, leading to significant improvements in recall at high precision levels across real-world datasets.
Contribution
It provides a theoretical framework, proves NP-hardness of the joint score-uncertainty decision problem, and develops algorithms that improve classifier performance by leveraging uncertainty.
Findings
25%-40% gain in recall at high precision bounds
Algorithms outperform traditional score-based methods
Theoretical analysis links bias to uncertainty and score
Abstract
Binary classification involves predicting the label of an instance based on whether the model score for the positive class exceeds a threshold chosen based on the application requirements (e.g., maximizing recall for a precision bound). However, model scores are often not aligned with the true positivity rate. This is especially true when the training involves a differential sampling across classes or there is distributional drift between train and test settings. In this paper, we provide theoretical analysis and empirical evidence of the dependence of model score estimation bias on both uncertainty and score itself. Further, we formulate the decision boundary selection in terms of both model score and uncertainty, prove that it is NP-hard, and present algorithms based on dynamic programming and isotonic regression. Evaluation of the proposed algorithms on three real-world datasets…
Peer Reviews
Decision·ICLR 2024 poster
Paper proposes novel theoretical analysis of combining predictive uncertainty and classification score for decision boundary selection. The findings are supported by empirical analysis and versatile set of tests. It is shown that optimisation problem is NP-hard, and the new practical algorithms to approximate the solution with dynamic programming and isotonic regression based approaches, are presented. To my knowledge, the analysis and proposed algorithms for this particular problem are novel co
The proposed approach is motivated by the applications such as medical diagnosis and fraud detection. However, only advertising and e-commerce datasets are considered in the empirical experiment section. It would strengthen the work, if other additional imbalanced datasets from different application domains could have been experimented, as well.
The paper addresses an interesting and important problem—making decisions under uncertainty. It covers the topic in a rather complete way, by first highlighting the relationship between uncertainty and estimation bias, and then proposing a strategy for improving decision-making by taking the uncertainty level into account. The proposal is overall sound, with some theoretical grounding as well as an experimental study so as to support the claims.
My main criticism is that the paper ignores a large part of the literature on decision-making under uncertainty. There already exist many works dedicated to this topic, with the purpose of taking uncertainty into account to improve the decision-making process or abstaining to make decisions when uncertainty is too high. Some of these works are rooted in formalisms alternative to probabilities, while others make use of the classical probabilistic framework. It is obviously not possible to mention
Uncertainty quantification is a hot topic in machine learning. The submission also contains material that is novel.
This paper is not well written. I have the impression that the authors lack a thorough understanding of the literature. They use unconventional notions, and they don't discuss the problem in a formal way. Let me make this point clear. Introduction: - the authors write "benefits of combining model score with estimates of aleatoric and epistemic uncertainty". This is a very weird reasoning, because the model score corresponds to the estimate of aleatoric uncertainty. (btw, better to use conditio
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications · Data Stream Mining Techniques
