Taming Subgraph Isomorphism for RDF Query Processing
Jinha Kim (1, 2), Hyungyu Shin (1), Wook-Shin Han (1), Sungpack, Hong (2), and Hassan Chafi (2) ((1) POSTECH, South Korea, (2) Oracle Labs,, USA)

TL;DR
This paper introduces TurboHOM++, an optimized in-memory subgraph isomorphism algorithm tailored for RDF query processing, significantly outperforming existing RDF engines by up to five orders of magnitude.
Contribution
It adapts a state-of-the-art subgraph isomorphism algorithm for RDF data, incorporating transformations and optimizations to efficiently handle RDF pattern matching semantics.
Findings
TurboHOM++ outperforms existing RDF engines by up to five orders of magnitude.
The proposed approach effectively handles large-scale RDF data in memory.
Extensive experiments validate the efficiency and scalability of TurboHOM++.
Abstract
RDF data are used to model knowledge in various areas such as life sciences, Semantic Web, bioinformatics, and social graphs. The size of real RDF data reaches billions of triples. This calls for a framework for efficiently processing RDF data. The core function of processing RDF data is subgraph pattern matching. There have been two completely different directions for supporting efficient subgraph pattern matching. One direction is to develop specialized RDF query processing engines exploiting the properties of RDF data for the last decade, while the other direction is to develop efficient subgraph isomorphism algorithms for general, labeled graphs for over 30 years. Although both directions have a similar goal (i.e., finding subgraphs in data graphs for a given query graph), they have been independently researched without clear reason. We argue that a subgraph isomorphism algorithm…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
