A Study of Scientific Computational Notebook Quality
Shun Kashiwa, Ayla Kurdak, Savitha Ravi, Ridhi Srikanth, Angel Thakur, Sonia Chandra, Jonathan Truong, and Michael Coblenz

TL;DR
This paper evaluates scientific code quality in research notebooks, revealing reproducibility issues, code duplication, and tangled state changes, highlighting the need for better tools to improve scientific software reliability.
Contribution
It provides a comprehensive analysis of scientific code quality across reproducibility, readability, and reusability, based on a large corpus of research notebooks and multiple assessment methods.
Findings
Only 2 of 19 notebooks were reproducible
Widespread code duplication among notebooks
Scientific notebooks often have tangled state changes
Abstract
The quality of scientific code is a critical concern for the research community. Poorly written code can result in irreproducible results, incorrect findings, and slower scientific progress. In this study, we evaluate scientific code quality across three dimensions: reproducibility, readability, and reusability. We curated a corpus of 518 code repositories by analyzing Code Availability statements from all 1239 Nature publications in 2024. To assess code quality, we employed multiple methods, including manual attempts to reproduce Jupyter notebooks, documentation reviews, and analyses of code clones and mutation patterns. Our results reveal major challenges in scientific code quality. Of the 19 notebooks we attempted to execute, only two were reproducible, primarily due to missing data files and dependency issues. Code duplication was also common, with 326 clone classes of at least 10…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Software Engineering Research · Research Data Management Practices
