Pre-registration for Predictive Modeling
Jake M. Hofman, Angelos Chatzimparmpas, Amit Sharma, Duncan J. Watts,, Jessica Hullman

TL;DR
This paper investigates the adaptation of pre-registration practices from explanatory to predictive modeling to enhance reproducibility, reduce bias, and improve reliability in machine learning research.
Contribution
It introduces a lightweight pre-registration template for predictive modeling and provides qualitative insights from researchers on its potential benefits and limitations.
Findings
Pre-registration can help prevent biased estimates.
Researchers see value in pre-registration for improving reliability.
Pre-registration has limitations in addressing all predictive modeling challenges.
Abstract
Amid rising concerns of reproducibility and generalizability in predictive modeling, we explore the possibility and potential benefits of introducing pre-registration to the field. Despite notable advancements in predictive modeling, spanning core machine learning tasks to various scientific applications, challenges such as overlooked contextual factors, data-dependent decision-making, and unintentional re-use of test data have raised questions about the integrity of results. To address these issues, we propose adapting pre-registration practices from explanatory modeling to predictive modeling. We discuss current best practices in predictive modeling and their limitations, introduce a lightweight pre-registration template, and present a qualitative study with machine learning researchers to gain insight into the effectiveness of pre-registration in preventing biased estimates and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Healthcare · Machine Learning in Materials Science
