From Code to Figure: A FAIR-Aligned Data Provenance Chain for Reproducible Simulation Research in Numerical Physics
Markus Uehlein, Tobias Held, Christopher Seibel, Lukas G. Jonda, Baerbel Rethfeld, and Sebastian T. Weber

TL;DR
This paper introduces an integrated workflow that enhances reproducibility and FAIR data principles in numerical physics simulations by linking code, data, and outputs through structured practices.
Contribution
It presents a comprehensive approach combining version control, testing, logging, and metadata to improve traceability in simulation research.
Findings
Workflow supports reproducibility in numerical physics simulations.
Concepts are broadly applicable beyond the specific framework.
Enhances traceability from code to publication.
Abstract
Computational physics increasingly depends on large simulation datasets generated by software that remains under active development for many years. In such settings, reproducibility requires not only well documented data but also explicit links between code versions, simulation inputs, generated outputs, analysis steps, and published figures. Here, we present an integrated workflow for reproducible and FAIR-aligned simulation research in numerical physics. We describe how version control, code review, automated testing, structured logging, metadata-rich output, and standardized post-processing can be combined to support traceability from software development to publication. The presented concepts demonstrated for one particular simulation framework are broadly applicable to computational physics and other data-intensive areas of scientific computing.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
