Driving down Poisson error can offset classification error in clinical   tasks

Charles B. Delahunt; Courosh Mehanian; and Matthew P. Horning

arXiv:2405.06065·cs.LG·May 21, 2024

Driving down Poisson error can offset classification error in clinical tasks

Charles B. Delahunt, Courosh Mehanian, and Matthew P. Horning

PDF

Open Access

TL;DR

This paper explores how increasing sample size in medical ML tasks can offset classification errors by reducing Poisson variability, improving clinical performance in tasks like malaria diagnosis.

Contribution

It introduces a mathematical framework linking Poisson error, classification error, and total error, enabling optimization of ML systems through sample size adjustments.

Findings

01

Larger sample sizes can compensate for lower classification accuracy.

02

Poisson variability decreases with increased sample size, improving estimate reliability.

03

Mathematical toolkit aids in designing ML systems for clinical tasks.

Abstract

Medical machine learning algorithms are typically evaluated based on accuracy vs. a clinician-defined ground truth, a reasonable initial choice since trained clinicians are usually better classifiers than ML models. However, this metric does not fully capture the actual clinical task: it neglects the fact that humans, even with perfect accuracy, are subject to non-trivial error from the Poisson statistics of rare events, because clinical protocols often specify a relatively small sample size. For example, to quantitate malaria on a thin blood film a clinician examines only 2000 red blood cells (0.0004 uL), which can yield large Poisson variation in the actual number of parasites present, so that a perfect human's count can differ substantially from the true average load. In contrast, an ML system may be less accurate on an object level, but it may also have the option to examine more…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare