Enabling Fine-grained RDF Data Completeness Assessment
Fariz Darari, Simon Razniewski, Radityo Eko Prasojo, Werner Nutt

TL;DR
This paper presents a technique for assessing the completeness of RDF data, especially in large, crowdsourced datasets like Wikidata, by developing scalable reasoning methods and a practical tool.
Contribution
It introduces a scalable reasoning framework for RDF completeness, including a tool for Wikidata, addressing the challenge of data completeness assessment at web scale.
Findings
Developed a completeness reasoning technique for RDF data.
Created COOL-WD, a tool for annotating and reasoning about Wikidata completeness.
Validated the approach on Wikidata, demonstrating practical scalability.
Abstract
Nowadays, more and more RDF data is becoming available on the Semantic Web. While the Semantic Web is generally incomplete by nature, on certain topics, it already contains complete information and thus, queries may return all answers that exist in reality. In this paper we develop a technique to check query completeness based on RDF data annotated with completeness information, taking into account data-specific inferences that lead to an inference problem which is -complete. We then identify a practically relevant fragment of completeness information, suitable for crowdsourced, entity-centric RDF data sources such as Wikidata, for which we develop an indexing technique that allows to scale completeness reasoning to Wikidata-scale data sources. We verify the applicability of our framework using Wikidata and develop COOL-WD, a completeness tool for Wikidata, used to annotate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Data Quality and Management · Service-Oriented Architecture and Web Services
