Designing PIDs for Reproducible Science Using Time-Series Data

Wen Ting Maria Tu; Stephen Makonin

arXiv:2209.10475·cs.DB·September 22, 2022

Designing PIDs for Reproducible Science Using Time-Series Data

Wen Ting Maria Tu, Stephen Makonin

PDF

Open Access

TL;DR

This paper proposes a preliminary method utilizing persistent identifiers (PIDs) to enhance reproducibility in scientific research with time-series data, with potential applicability to other dataset types.

Contribution

It introduces a novel approach for using PIDs to improve reproducibility in scientific research involving time-series data.

Findings

01

Proposed a PID-based methodology for reproducible research

02

Demonstrated potential for applying the method to various dataset types

03

Contributed to standards development in data governance

Abstract

As part of the investigation done by the IEEE Standards Association P2957 Working Group, called Big Data Governance and Metadata Management, the use of persistent identifiers (PIDs) is looked at for tackling the problem of reproducible research and science. This short paper proposes a preliminary method using PIDs to reproduce research results using time-series data. Furthermore, we feel it is possible to use the methodology and design for other types of datasets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsScientific Computing and Data Management · Research Data Management Practices · Big Data and Business Intelligence