Classification Under Uncertainty: Data Analysis for Diagnostic Antibody Testing
Paul N. Patrone, Anthony J. Kearsley

TL;DR
This paper introduces a novel decision-theoretic approach for classification in diagnostic antibody testing that explicitly accounts for disease prevalence and measurement uncertainty, significantly reducing errors compared to traditional methods.
Contribution
It develops an optimal classification framework that incorporates prevalence and uncertainty, with adaptive and hold-out strategies for unknown prevalence, and extends ROC analysis through optimization.
Findings
Reduces classification errors by up to a decade.
Provides a theoretical foundation linking ROC to optimization.
Demonstrates effectiveness on SARS-CoV-2 serology data.
Abstract
Formulating accurate and robust classification strategies is a key challenge of developing diagnostic and antibody tests. Methods that do not explicitly account for disease prevalence and uncertainty therein can lead to significant classification errors. We present a novel method that leverages optimal decision theory to address this problem. As a preliminary step, we develop an analysis that uses an assumed prevalence and conditional probability models of diagnostic measurement outcomes to define optimal (in the sense of minimizing rates of false positives and false negatives) classification domains. Critically, we demonstrate how this strategy can be generalized to a setting in which the prevalence is unknown by either: (i) defining a third class of hold-out samples that require further testing; or (ii) using an adaptive algorithm to estimate prevalence prior to defining…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
