KGTK: A Toolkit for Large Knowledge Graph Manipulation and Analysis

Filip Ilievski; Daniel Garijo; Hans Chalupsky; Naren Teja; Divvala; Yixiang Yao; Craig Rogers; Rongpeng Li; Jun Liu and; Amandeep Singh; Daniel Schwabe; Pedro Szekely

arXiv:2006.00088·cs.AI·May 27, 2021

KGTK: A Toolkit for Large Knowledge Graph Manipulation and Analysis

Filip Ilievski, Daniel Garijo, Hans Chalupsky, Naren Teja, Divvala, Yixiang Yao, Craig Rogers, Rongpeng Li, Jun Liu and, Amandeep Singh, Daniel Schwabe, Pedro Szekely

PDF

1 Repo

TL;DR

KGTK is a comprehensive, data science-oriented toolkit that simplifies the manipulation, transformation, and analysis of large knowledge graphs like Wikidata and DBpedia by leveraging table-based representations and popular data science libraries.

Contribution

The paper introduces KGTK, a unified toolkit that addresses the heterogeneity and complexity of existing KG tools by providing a table-based, scalable, and user-friendly framework for KG operations.

Findings

01

Successfully integrated large KGs like Wikidata and DBpedia.

02

Enabled complex KG transformations using familiar data science libraries.

03

Demonstrated practical applications in real-world scenarios.

Abstract

Knowledge graphs (KGs) have become the preferred technology for representing, sharing and adding knowledge to modern AI applications. While KGs have become a mainstream technology, the RDF/SPARQL-centric toolset for operating with them at scale is heterogeneous, difficult to integrate and only covers a subset of the operations that are commonly needed in data science applications. In this paper we present KGTK, a data science-centric toolkit designed to represent, create, transform, enhance and analyze KGs. KGTK represents graphs in tables and leverages popular libraries developed for data science applications, enabling a wide audience of developers to easily construct knowledge graph pipelines for their applications. We illustrate the framework with real-world scenarios where we have used KGTK to integrate and manipulate large KGs, such as Wikidata, DBpedia and ConceptNet.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

usc-isi-i2/kgtk
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.