Designing PIDs for Reproducible Science Using Time-Series Data
Wen Ting Maria Tu, Stephen Makonin

TL;DR
This paper proposes a preliminary method utilizing persistent identifiers (PIDs) to enhance reproducibility in scientific research with time-series data, with potential applicability to other dataset types.
Contribution
It introduces a novel approach for using PIDs to improve reproducibility in scientific research involving time-series data.
Findings
Proposed a PID-based methodology for reproducible research
Demonstrated potential for applying the method to various dataset types
Contributed to standards development in data governance
Abstract
As part of the investigation done by the IEEE Standards Association P2957 Working Group, called Big Data Governance and Metadata Management, the use of persistent identifiers (PIDs) is looked at for tackling the problem of reproducible research and science. This short paper proposes a preliminary method using PIDs to reproduce research results using time-series data. Furthermore, we feel it is possible to use the methodology and design for other types of datasets.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Research Data Management Practices · Big Data and Business Intelligence
