A Statistical Framework for Measuring Reproducibility and Replicability of High‐Throughput Experiments From Multiple Sources
Monia Ranalli, Yafei Lyu, Hillary Koch, Qunhua Li

TL;DR
This paper introduces a statistical model to measure how reproducible and replicable results are across multiple high-throughput experiments.
Contribution
A novel statistical framework using a nested copula mixture model to assess reproducibility and replicability across multi-source experiments.
Findings
The model effectively identifies sources of irreproducibility in high-throughput data.
Simulation and real datasets show improved reliability of scientific discoveries using the framework.
Abstract
Replication is essential to reliable and consistent scientific discovery in high‐throughput experiments. Quantifying the replicability of scientific discoveries and identifying sources of irreproducibility have become important tasks for quality control and data integration. In this work we introduce a novel statistical model to measure the reproducibility and replicability of findings from replicate experiments in multi‐source studies. Using a nested copula mixture model that characterizes the interdependence between replication experiments both across and within sources, our method quantifies reproducibility and replicability of each candidate simultaneously in a coherent framework. Through simulation studies, an ENCODE ChIP‐seq dataset and a SEQC RNA‐seq dataset, we demonstrate the effectiveness of our method in diagnosing the source of discordance and improving the reliability of…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCell Image Analysis Techniques · Scientific Computing and Data Management · Genomics and Phylogenetic Studies
