Wikidated 1.0: An Evolving Knowledge Graph Dataset of Wikidata's Revision History
Lukas Schmelzeisen, Corina Dima, Steffen Staab

TL;DR
Wikidated 1.0 is the first large-scale dataset capturing the full revision history of Wikidata, encoding changes as RDF triple additions and deletions, enabling research on evolving knowledge graphs.
Contribution
The paper introduces Wikidated 1.0, a novel dataset of Wikidata's revision history, and details its generation methodology and statistical properties.
Findings
First comprehensive dataset of an evolving knowledge graph.
Detailed analysis of Wikidata's revision patterns.
Potential for research on knowledge graph evolution.
Abstract
Wikidata is the largest general-interest knowledge base that is openly available. It is collaboratively edited by thousands of volunteer editors and has thus evolved considerably since its inception in 2012. In this paper, we present Wikidated 1.0, a dataset of Wikidata's full revision history, which encodes changes between Wikidata revisions as sets of deletions and additions of RDF triples. To the best of our knowledge, it constitutes the first large dataset of an evolving knowledge graph, a recently emerging research subject in the Semantic Web community. We introduce the methodology for generating Wikidated 1.0 from dumps of Wikidata, discuss its implementation and limitations, and present statistical characteristics of the dataset.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topic Modeling · Natural Language Processing Techniques
MethodsBalanced Selection
