Classification Under Uncertainty: Data Analysis for Diagnostic Antibody   Testing

Paul N. Patrone; Anthony J. Kearsley

arXiv:2012.10021·stat.ME·February 1, 2022

Classification Under Uncertainty: Data Analysis for Diagnostic Antibody Testing

Paul N. Patrone, Anthony J. Kearsley

PDF

TL;DR

This paper introduces a novel decision-theoretic approach for classification in diagnostic antibody testing that explicitly accounts for disease prevalence and measurement uncertainty, significantly reducing errors compared to traditional methods.

Contribution

It develops an optimal classification framework that incorporates prevalence and uncertainty, with adaptive and hold-out strategies for unknown prevalence, and extends ROC analysis through optimization.

Findings

01

Reduces classification errors by up to a decade.

02

Provides a theoretical foundation linking ROC to optimization.

03

Demonstrates effectiveness on SARS-CoV-2 serology data.

Abstract

Formulating accurate and robust classification strategies is a key challenge of developing diagnostic and antibody tests. Methods that do not explicitly account for disease prevalence and uncertainty therein can lead to significant classification errors. We present a novel method that leverages optimal decision theory to address this problem. As a preliminary step, we develop an analysis that uses an assumed prevalence and conditional probability models of diagnostic measurement outcomes to define optimal (in the sense of minimizing rates of false positives and false negatives) classification domains. Critically, we demonstrate how this strategy can be generalized to a setting in which the prevalence is unknown by either: (i) defining a third class of hold-out samples that require further testing; or (ii) using an adaptive algorithm to estimate prevalence prior to defining…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.