Improving metadata flows -- The simultaneous use of multiple metadata schemas at disciplinary research data repositories
Dorothea Strecker

TL;DR
This paper examines how research data repositories in geosciences and social sciences use multiple metadata schemas simultaneously, analyzing schema differences, workflows, and potential improvements for metadata quality and expressiveness.
Contribution
It provides an empirical analysis of multi-schema metadata use, highlighting schema differences, workflow diversity, and opportunities for schema extension and optimization.
Findings
DataCite metadata can be improved with better schema crosswalks
Disciplinary metadata schemas are tailored to specific research domains
Metadata workflows are diverse and sometimes suboptimal
Abstract
This study investigates the simultaneous use of multiple metadata schemas at research data repositories. The analysis covers how eight disciplinary research data repositories from the geosciences and social sciences use disciplinary metadata schemas and the DataCite Metadata Schema, and how two metadata records describing the same dataset compare. The results show that DataCite metadata records could be improved considerably by optimizing schema crosswalks. However, the parallel use of disciplinary and multidisciplinary metadata records is complex. For example, discipline has a significant effect on the completeness of DataCite metadata. A temporal analysis also highlights that metadata workflows are diverse, and in some cases, suboptimal crosswalks are likely not the sole cause of incomplete DataCite metadata. Comparing the disciplinary metadata schemas and the DataCite Metadata Schema…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsResearch Data Management Practices · scientometrics and bibliometrics research · Library Science and Information Systems
