Creating and Querying Personalized Versions of Wikidata on a Laptop
Hans Chalupsky, Pedro Szekely, Filip Ilievski, Daniel Garijo and, Kartik Shenoy

TL;DR
This paper introduces KGTK Kypher, a new query language and processor that enables users to create personalized, full-scale Wikidata variants on a laptop, supporting complex queries faster than traditional methods.
Contribution
The paper presents KGTK Kypher, a novel query system that allows efficient, personalized querying of the entire Wikidata knowledge graph on a laptop.
Findings
Kypher queries run significantly faster than SPARQL on large datasets.
Users can perform complex analyses on full Wikidata locally.
Kypher enables integration of external data sources like DBpedia.
Abstract
Application developers today have three choices for exploiting the knowledge present in Wikidata: they can download the Wikidata dumps in JSON or RDF format, they can use the Wikidata API to get data about individual entities, or they can use the Wikidata SPARQL endpoint. None of these methods can support complex, yet common, query use cases, such as retrieval of large amounts of data or aggregations over large fractions of Wikidata. This paper introduces KGTK Kypher, a query language and processor that allows users to create personalized variants of Wikidata on a laptop. We present several use cases that illustrate the types of analyses that Kypher enables users to run on the full Wikidata KG on a laptop, combining data from external resources such as DBpedia. The Kypher queries for these use cases run much faster on a laptop than the equivalent SPARQL queries on a Wikidata clone…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Natural Language Processing Techniques · Web Data Mining and Analysis
