How Stable is Knowledge Base Knowledge?
Suhas Shrinivasan, Simon Razniewski

TL;DR
This paper investigates the stability of knowledge bases by analyzing real-world changes, proposing heuristics to distinguish genuine world-driven updates from other data modifications, and demonstrating the potential to predict stability with high accuracy.
Contribution
It introduces heuristics to differentiate real-world changes from data corrections and delays, and evaluates the predictability of KB stability with up to 83% F1 score.
Findings
High skew in property change behavior across Wikidata domains
Heuristics effectively identify entities and properties unlikely to change
Stability of property changes can be predicted with up to 83% F1 score
Abstract
Knowledge Bases (KBs) provide structured representation of the real-world in the form of extensive collections of facts about real-world entities, their properties and relationships. They are ubiquitous in large-scale intelligent systems that exploit structured information such as in tasks like structured search, question answering and reasoning, and hence their data quality becomes paramount. The inevitability of change in the real-world, brings us to a central property of KBs -- they are highly dynamic in that the information they contain are constantly subject to change. In other words, KBs are unstable. In this paper, we investigate the notion of KB stability, specifically, the problem of KBs changing due to real-world change. Some entity-property-pairs do not undergo change in reality anymore (e.g., Einstein-children or Tesla-founders), while others might well change in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Data Stream Mining Techniques · Semantic Web and Ontologies
