Building a Disciplinary, World-Wide Data Infrastructure
Fran\c{c}oise Genova, Christophe Arviset, Bridget M. Almas, Laura, Bartolo, Daan Broeder, Emily Law, Brian McMahon

TL;DR
This paper examines how various scientific disciplines organize data sharing infrastructure at the international level, highlighting commonalities, differences, and key success factors for building interoperable, discipline-specific data systems.
Contribution
It provides a comparative analysis of disciplinary data infrastructures across multiple fields, identifying shared principles and challenges in establishing effective data sharing frameworks.
Findings
Data sharing should be driven by scientific needs
Disciplinary standards are essential but difficult to define
Social factors pose greater challenges than technological ones
Abstract
Sharing scientific data, with the objective of making it fully discoverable, accessible, assessable, intelligible, usable, and interoperable, requires work at the disciplinary level to define in particular how the data should be formatted and described. Each discipline has its own organization and history as a starting point, and this paper explores the way a range of disciplines, namely materials science, crystallography, astronomy, earth sciences, humanities and linguistics get organized at the international level to tackle this question. In each case, the disciplinary culture with respect to data sharing, science drivers, organization and lessons learnt are briefly described, as well as the elements of the specific data infrastructure which are or could be shared with others. Commonalities and differences are assessed. Common key elements for success are identified: data sharing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
