Continuous Sweep for Binary Quantification Learning
Kevin Kloos, Julian D. Karch, Quinten A. Meertens, Mark de Rooij

TL;DR
This paper introduces Continuous Sweep, a new binary quantifier inspired by Median Sweep, with modifications allowing theoretical analysis and improved performance in estimating class prevalence.
Contribution
The paper proposes Continuous Sweep, a novel parametric binary quantifier with theoretical bias-variance analysis and superior empirical performance over existing methods.
Findings
Continuous Sweep outperforms Median Sweep and other quantifiers in simulations.
Theoretical bias and variance expressions enable optimal decision boundary determination.
Empirical results show competitive performance on real datasets.
Abstract
A quantifier is a supervised machine learning algorithm, focused on estimating the class prevalence in a dataset rather than labeling its individual observations. We introduce Continuous Sweep, a new parametric binary quantifier inspired by the well-performing Median Sweep, which is an ensemble method based on Adjusted Count estimators. We modified two aspects of Median Sweep: 1) using parametric class distributions instead of empirical distributions for the true and false positive rate; 2) using the mean instead of the median of a set of Adjusted Count estimates. These two modifications allow for a theoretical analysis of the bias and variance of Continuous Sweep. Furthermore, the expressions of bias and variance can be used to define optimal decision boundaries of the set of Adjusted count estimates to be used in the ensemble. We show in three simulation studies that Continuous Sweep…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Statistical Methods and Inference · Bayesian Modeling and Causal Inference
