Driving down Poisson error can offset classification error in clinical tasks
Charles B. Delahunt, Courosh Mehanian, and Matthew P. Horning

TL;DR
This paper explores how increasing sample size in medical ML tasks can offset classification errors by reducing Poisson variability, improving clinical performance in tasks like malaria diagnosis.
Contribution
It introduces a mathematical framework linking Poisson error, classification error, and total error, enabling optimization of ML systems through sample size adjustments.
Findings
Larger sample sizes can compensate for lower classification accuracy.
Poisson variability decreases with increased sample size, improving estimate reliability.
Mathematical toolkit aids in designing ML systems for clinical tasks.
Abstract
Medical machine learning algorithms are typically evaluated based on accuracy vs. a clinician-defined ground truth, a reasonable initial choice since trained clinicians are usually better classifiers than ML models. However, this metric does not fully capture the actual clinical task: it neglects the fact that humans, even with perfect accuracy, are subject to non-trivial error from the Poisson statistics of rare events, because clinical protocols often specify a relatively small sample size. For example, to quantitate malaria on a thin blood film a clinician examines only 2000 red blood cells (0.0004 uL), which can yield large Poisson variation in the actual number of parasites present, so that a perfect human's count can differ substantially from the true average load. In contrast, an ML system may be less accurate on an object level, but it may also have the option to examine more…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare
