A Bayesian Approach for Accurate Classification-Based Aggregates
Q. A. Meertens, C. G. H. Diks, H. J. van den Herik, F W Takes

TL;DR
This paper introduces a Bayesian bias correction method for classification-based aggregate estimates, effectively addressing estimation inaccuracies and negative value issues, especially in small sample scenarios with class imbalance.
Contribution
The paper presents a novel Bayesian inference approach that incorporates parameter constraints to improve bias correction in classification-based aggregate estimation.
Findings
Outperforms existing methods in mean squared error on real-world data
Effectively handles class imbalance and small sample size issues
Addresses estimation inaccuracies and negative estimate problems
Abstract
In this paper, we study the accuracy of values aggregated over classes predicted by a classification algorithm. The problem is that the resulting aggregates (e.g., sums of a variable) are known to be biased. The bias can be large even for highly accurate classification algorithms, in particular when dealing with class-imbalanced data. To correct this bias, the algorithm's classification error rates have to be estimated. In this estimation, two issues arise when applying existing bias correction methods. First, inaccuracies in estimating classification error rates have to be taken into account. Second, impermissible estimates, such as a negative estimate for a positive value, have to be dismissed. We show that both issues are relevant in applications where the true labels are known only for a small set of data points. We propose a novel bias correction method using Bayesian inference.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Bayesian Inference · Advanced Statistical Methods and Models · Statistical Methods and Inference
