Towards a Rosetta Stone for (meta)data: Learning from natural language to improve semantic and cognitive interoperability
Lars Vogt, Marcel Konrad, Manuel Prinz

TL;DR
This paper introduces a machine-actionable Rosetta Stone Framework that leverages natural language insights to improve semantic and cognitive interoperability of (meta)data through reference terms, schemata, and user-friendly tools.
Contribution
It proposes a novel Rosetta modeling paradigm and editor that enable domain experts to create interoperable schemata without deep semantic knowledge.
Findings
Framework minimizes mappings and crosswalks between data schemas.
Modeling statements improves interoperability and cognitive familiarity.
Tools support human-readable and machine-actionable data representations.
Abstract
In order to effectively manage the overwhelming influx of data, it is crucial to ensure that data is findable, accessible, interoperable, and reusable (FAIR). While ontologies and knowledge graphs have been employed to enhance FAIRness, challenges remain regarding semantic and cognitive interoperability. We explore how English facilitates reliable communication of terms and statements, and transfer our findings to a framework of ontologies and knowledge graphs, while treating terms and statements as minimal information units. We categorize statement types based on their predicates, recognizing the limitations of modeling non-binary predicates with multiple triples, which negatively impacts interoperability. Terms are associated with different frames of reference, and different operations require different schemata. Term mappings and schema crosswalks are therefore vital for semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Research Data Management Practices · Scientific Computing and Data Management
