Reliable Granular References to Changing Linked Data
Tobias Kuhn, Egon Willighagen, Chris Evelo, N\'uria Queralt-Rosinach,, Emilio Centeno, Laura I. Furlong

TL;DR
This paper improves the efficiency and reliability of referencing evolving nanopublication datasets by leveraging version history and incremental updates, reducing overhead and enabling precise subset retrieval.
Contribution
It introduces a method to manage nanopublication datasets efficiently over time by utilizing version history and incremental updates, enhancing reference reliability.
Findings
Overhead decreases when using version history and incremental updates.
Efficient referencing of dataset subsets is achievable with optimized precision.
Total size of datasets is reduced with the proposed approach.
Abstract
Nanopublications are a concept to represent Linked Data in a granular and provenance-aware manner, which has been successfully applied to a number of scientific datasets. We demonstrated in previous work how we can establish reliable and verifiable identifiers for nanopublications and sets thereof. Further adoption of these techniques, however, was probably hindered by the fact that nanopublications can lead to an explosion in the number of triples due to auxiliary information about the structure of each nanopublication and repetitive provenance and metadata. We demonstrate here that this significant overhead disappears once we take the version history of nanopublication datasets into account, calculate incremental updates, and allow users to deal with the specific subsets they need. We show that the total size and overhead of evolving scientific datasets is reduced, and typical subsets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Biomedical Text Mining and Ontologies · Library Science and Information Systems
