Modeling community standards for metadata as templates makes data FAIR
Mark A. Musen, Martin J. O'Connor, Erik Schultes, Marcos, Martinez-Romero, Josef Hardi, and John Graybeal

TL;DR
This paper proposes a template-based approach for defining community-specific metadata standards to improve the FAIRness of datasets, enabling better data sharing, stewardship, and interoperability across scientific communities.
Contribution
It introduces a system for creating and evaluating machine-actionable, community-specific metadata templates to enhance dataset FAIRness.
Findings
Templates improve metadata consistency and FAIR compliance.
Software systems facilitate template creation and evaluation.
Community standards embedded in templates support data sharing.
Abstract
It is challenging to determine whether datasets are findable, accessible, interoperable, and reusable (FAIR) because the FAIR Guiding Principles refer to highly idiosyncratic criteria regarding the metadata used to annotate datasets. Specifically, the FAIR principles require metadata to be "rich" and to adhere to "domain-relevant" community standards. Scientific communities should be able to define their own machine-actionable templates for metadata that encode these "rich," discipline-specific elements. We have explored this template-based approach in the context of two software systems. One system is the CEDAR Workbench, which investigators use to author new metadata. The other is the FAIRware Workbench, which evaluates the metadata of archived datasets for their adherence to community standards. Benefits accrue when templates for metadata become central elements in an ecosystem of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Research Data Management Practices · Data Quality and Management
