Reproducible Research: A Retrospective
Roger D. Peng, Stephanie C. Hicks

TL;DR
This paper reviews the importance of reproducibility and replicability in scientific research, especially in public health, highlighting challenges, current status, and proposing future improvements for reliable scientific progress.
Contribution
It provides a comprehensive retrospective analysis of reproducible research, clarifies its relationship with replicability, and suggests strategies to enhance scientific reliability.
Findings
Reproducibility is often hindered by lack of data and code availability.
Reproducibility and replicability serve different but related scientific goals.
Improving transparency can enhance both reproducibility and replicability.
Abstract
Rapid advances in computing technology over the past few decades have spurred two extraordinary phenomena in science: large-scale and high-throughput data collection coupled with the creation and implementation of complex statistical algorithms for data analysis. Together, these two phenomena have brought about tremendous advances in scientific discovery but have also raised two serious concerns, one relatively new and one quite familiar. The complexity of modern data analyses raises questions about the reproducibility of the analyses, meaning the ability of independent analysts to re-create the results claimed by the original authors using the original data and analysis techniques. While seemingly a straightforward concept, reproducibility of analyses is typically thwarted by the lack of availability of the data and computer code that were used in the analyses. A much more general…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
