Code Sharing In Prediction Model Research: A Scoping Review
Thomas Sounack, Raffaele Giancotti, Catherine A. Gao, Lasai Barre\~nada, Hyeonhoon Lee, Hyung-Chul Lee, Leo Anthony Celi, Karel G.M. Moons, Gary S. Collins, Charlotta Lindvall, Tom Pollard

TL;DR
This scoping review examines current code sharing practices in prediction model research, highlighting limited sharing rates and variability in repository quality to inform future reporting guidelines.
Contribution
It provides an empirical baseline on code sharing prevalence and repository quality, informing the development of the TRIPOD-Code reporting extension.
Findings
12.2% of articles shared code in 2025
Code sharing increased over time, reaching 15.8% in 2025
Repositories often lacked detailed dependencies and modularity
Abstract
Analytical code is essential for reproducing diagnostic and prognostic prediction model research, yet code availability in the published literature remains limited. While the TRIPOD statements set standards for reporting prediction model methods, they do not define explicit standards for repository structure and documentation. This review quantifies current code-sharing practices to inform the development of TRIPOD-Code, a TRIPOD extension reporting guideline focused on code sharing. We conducted a scoping review of PubMed-indexed articles citing TRIPOD or TRIPOD+AI as of Aug 11, 2025, restricted to studies retrievable via the PubMed Central Open Access API. Eligible studies developed, updated, or validated multivariable prediction models. A large language model-assisted pipeline was developed to screen articles and extract code availability statements and repository links.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
