How permanent are metadata for research data? Understanding changes in DataCite metadata
Dorothea Strecker

TL;DR
This study investigates the stability and change patterns of DataCite metadata records for research data, revealing that most records are stable with incremental updates, supporting their use in scientometric analyses.
Contribution
It provides the first detailed analysis of DataCite metadata provenance, highlighting the prevalence, nature, and implications of metadata changes for research data repositories.
Findings
12.18% of records experienced changes
Metadata changes are mostly incremental and limited
Metadata stability supports scientometric research
Abstract
With the move towards open research information, the DOI registration agency DataCite is increasingly used as a source for metadata describing research data, for example to perform scientometric analyses. However, there is a lack of research on how DataCite metadata describing research data are created and maintained. This paper adresses this gap by using DataCite metadata provenance information to analyze the overall prevalence and patterns of change to DataCite metadata records. Metadata change was observed for 12.18 % of metadata records in the sample, and change tends to be incremental and not extensive. DataCite metadata records offer reliable descriptions of datasets and are stable enough to be used in scientometric research. The rate of change differs from previous studies of metadata change in other contexts, suggesting that there are differences in metadata practices between…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
