Reconciling Binary Replicates: Beyond the Average
H. Lorenzo, P. Pudlo, M. Royer‐Carenzi

TL;DR
This paper explores better ways to analyze repeated binary medical data, showing that new methods like Bayesian approaches can improve diagnostic accuracy over traditional averaging.
Contribution
Proposes and evaluates three alternative methods to averaging for analyzing binary replicates, emphasizing Bayesian approaches with uncertainty.
Findings
Bayesian methods outperform averaging in diagnostic accuracy and provide credible intervals.
Simulations and real datasets show practical benefits of the proposed methods.
Incorporating uncertainty improves disease prevalence estimation.
Abstract
Binary observations are often repeated to improve data quality, creating technical replicates. Several scoring methods are commonly used to infer the actual individual state and obtain a probability for each state. The common practice of averaging replicates has limitations, and alternative methods for scoring and classifying individuals are proposed. Additionally, an indecisive response might be wiser than classifying all individuals based on their replicates in the medical context, where 1 indicates a particular health condition. Building on the inherent limitations of the averaging approach, three alternative methods are examined: the median, maximum penalized likelihood estimation, and a Bayesian algorithm. The theoretical analysis suggests that the proposed alternatives outperform the averaging approach, especially the Bayesian method, which incorporates uncertainty and provides…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13
Figure 14Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Statistical Methods and Bayesian Inference · Machine Learning in Healthcare
