Reconciling Binary Replicates: Beyond the Average

Manuela Royer-Carenzi; Hadrien Lorenzo; Pierre Pudlo

arXiv:2501.13745·stat.ME·January 24, 2025

Reconciling Binary Replicates: Beyond the Average

Manuela Royer-Carenzi, Hadrien Lorenzo, Pierre Pudlo

PDF

TL;DR

This paper explores alternative scoring methods for binary replicate data, proposing median, penalized likelihood, and Bayesian approaches that outperform simple averaging, especially in medical diagnostics.

Contribution

It introduces and compares three novel methods for binary replicate analysis, addressing limitations of averaging and enhancing diagnostic accuracy.

Findings

01

Bayesian method outperforms averaging in simulations

02

Proposed methods improve disease prevalence estimation

03

Bayesian approach provides credible intervals for uncertainty

Abstract

Binary observations are often repeated to improve data quality, creating technical replicates. Several scoring methods are commonly used to infer the actual individual state and obtain a probability for each state. The common practice of averaging replicates has limitations, and alternative methods for scoring and classifying individuals are proposed. Additionally, an indecisive response might be wiser than classifying all individuals based on their replicates in the medical context, where 1 indicates a particular health condition. Building on the inherent limitations of the averaging approach, three alternative methods are examined: the median, maximum penalized likelihood estimation, and a Bayesian algorithm. The theoretical analysis suggests that the proposed alternatives outperform the averaging approach, especially the Bayesian method, which incorporates uncertainty and provides…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.