Prediction scoring of data-driven discoveries for reproducible research
Anna L. Smith, Tian Zheng, and Andrew Gelman

TL;DR
This paper introduces a prediction scoring method based on cross-validation to compare data-generating mechanisms across studies, aiding reproducibility and understanding of complex models in behavioral research.
Contribution
It formalizes a distance measure between DGMs using prediction scores, enabling quantitative comparison of studies and assessment of model agreement.
Findings
Prediction scores effectively quantify differences between DGMs.
Cross-validated scores can evaluate hypotheses and compare populations.
Simulations show scores reveal important model differences.
Abstract
Predictive modeling uncovers knowledge and insights regarding a hypothesized data generating mechanism (DGM). Results from different studies on a complex DGM, derived from different data sets, and using complicated models and algorithms, are hard to quantitatively compare due to random noise and statistical uncertainty in model results. This has been one of the main contributors to the replication crisis in the behavioral sciences. The contribution of this paper is to apply prediction scoring to the problem of comparing two studies, such as can arise when evaluating replications or competing evidence. We examine the role of predictive models in quantitatively assessing agreement between two datasets that are assumed to come from two distinct DGMs. We formalize a distance between the DGMs that is estimated using cross validation. We argue that the resulting prediction scores depend…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Data Analysis with R
