Taming Subgraph Isomorphism for RDF Query Processing

Jinha Kim (1; 2); Hyungyu Shin (1); Wook-Shin Han (1); Sungpack; Hong (2); and Hassan Chafi (2) ((1) POSTECH; South Korea; (2) Oracle Labs,; USA)

arXiv:1506.01973·cs.DB·June 11, 2015

Taming Subgraph Isomorphism for RDF Query Processing

Jinha Kim (1, 2), Hyungyu Shin (1), Wook-Shin Han (1), Sungpack, Hong (2), and Hassan Chafi (2) ((1) POSTECH, South Korea, (2) Oracle Labs,, USA)

PDF

TL;DR

This paper introduces TurboHOM++, an optimized in-memory subgraph isomorphism algorithm tailored for RDF query processing, significantly outperforming existing RDF engines by up to five orders of magnitude.

Contribution

It adapts a state-of-the-art subgraph isomorphism algorithm for RDF data, incorporating transformations and optimizations to efficiently handle RDF pattern matching semantics.

Findings

01

TurboHOM++ outperforms existing RDF engines by up to five orders of magnitude.

02

The proposed approach effectively handles large-scale RDF data in memory.

03

Extensive experiments validate the efficiency and scalability of TurboHOM++.

Abstract

RDF data are used to model knowledge in various areas such as life sciences, Semantic Web, bioinformatics, and social graphs. The size of real RDF data reaches billions of triples. This calls for a framework for efficiently processing RDF data. The core function of processing RDF data is subgraph pattern matching. There have been two completely different directions for supporting efficient subgraph pattern matching. One direction is to develop specialized RDF query processing engines exploiting the properties of RDF data for the last decade, while the other direction is to develop efficient subgraph isomorphism algorithms for general, labeled graphs for over 30 years. Although both directions have a similar goal (i.e., finding subgraphs in data graphs for a given query graph), they have been independently researched without clear reason. We argue that a subgraph isomorphism algorithm…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.