# GSI: GPU-friendly Subgraph Isomorphism

**Authors:** Li Zeng, Lei Zou, M. Tamer \"Ozsu, Lin Hu, Fan Zhang

arXiv: 1906.03420 · 2021-04-21

## TL;DR

This paper introduces GSI, a GPU-optimized subgraph isomorphism algorithm that significantly outperforms existing methods by leveraging GPU features and novel data structures, enabling scalable analysis of large graphs.

## Contribution

The paper presents a novel GPU-friendly subgraph isomorphism algorithm with a vertex-oriented framework and a new data structure, improving scalability and efficiency over prior GPU solutions.

## Key findings

- GSI outperforms state-of-the-art algorithms by up to several orders of magnitude.
- GSI scales efficiently to graphs with hundreds of millions of edges.
- The proposed data structure and strategy reduce redundant join operations.

## Abstract

Subgraph isomorphism is a well-known NP-hard problem that is widely used in many applications, such as social network analysis and query over the knowledge graph. Due to the inherent hardness, its performance is often a bottleneck in various real-world applications. Therefore, we address this by designing an efficient subgraph isomorphism algorithm leveraging features of GPU architecture, such as massive parallelism and memory hierarchy. Existing GPU-based solutions adopt a two-step output scheme, performing the same join process twice in order to write intermediate results concurrently. They also lack GPU architecture-aware optimizations that allow scaling to large graphs. In this paper, we propose a GPU-friendly subgraph isomorphism algorithm, GSI. Different from existing edge join-based GPU solutions, we propose a Prealloc-Combine strategy based on the vertex-oriented framework, which avoids joining-twice in existing solutions. Also, a GPU-friendly data structure (called PCSR) is proposed to represent an edge-labeled graph. Extensive experiments on both synthetic and real graphs show that GSI outperforms the state-of-the-art algorithms by up to several orders of magnitude and has good scalability with graph size scaling to hundreds of millions of edges.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1906.03420/full.md

## Figures

33 figures with captions in the complete paper: https://tomesphere.com/paper/1906.03420/full.md

## References

40 references — full list in the complete paper: https://tomesphere.com/paper/1906.03420/full.md

---
Source: https://tomesphere.com/paper/1906.03420