Code Sharing In Prediction Model Research: A Scoping Review

Thomas Sounack; Raffaele Giancotti; Catherine A. Gao; Lasai Barre\~nada; Hyeonhoon Lee; Hyung-Chul Lee; Leo Anthony Celi; Karel G.M. Moons; Gary S. Collins; Charlotta Lindvall; Tom Pollard

arXiv:2604.06212·cs.SE·April 9, 2026

Code Sharing In Prediction Model Research: A Scoping Review

Thomas Sounack, Raffaele Giancotti, Catherine A. Gao, Lasai Barre\~nada, Hyeonhoon Lee, Hyung-Chul Lee, Leo Anthony Celi, Karel G.M. Moons, Gary S. Collins, Charlotta Lindvall, Tom Pollard

PDF

TL;DR

This scoping review examines current code sharing practices in prediction model research, highlighting limited sharing rates and variability in repository quality to inform future reporting guidelines.

Contribution

It provides an empirical baseline on code sharing prevalence and repository quality, informing the development of the TRIPOD-Code reporting extension.

Findings

01

12.2% of articles shared code in 2025

02

Code sharing increased over time, reaching 15.8% in 2025

03

Repositories often lacked detailed dependencies and modularity

Abstract

Analytical code is essential for reproducing diagnostic and prognostic prediction model research, yet code availability in the published literature remains limited. While the TRIPOD statements set standards for reporting prediction model methods, they do not define explicit standards for repository structure and documentation. This review quantifies current code-sharing practices to inform the development of TRIPOD-Code, a TRIPOD extension reporting guideline focused on code sharing. We conducted a scoping review of PubMed-indexed articles citing TRIPOD or TRIPOD+AI as of Aug 11, 2025, restricted to studies retrievable via the PubMed Central Open Access API. Eligible studies developed, updated, or validated multivariable prediction models. A large language model-assisted pipeline was developed to screen articles and extract code availability statements and repository links.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.