Relational Approach for Shortest Path Discovery over Large Graphs
Jun Gao, Ruoming Jin, Jiashuai Zhou, Jeffrey Xu Yu, Xiao Jiang,, Tengjiao Wang

TL;DR
This paper presents a relational database framework for efficient shortest path discovery in large graphs, utilizing iterative operations, SQL features, and optimization strategies like bi-directional search and SegTable indexing.
Contribution
It introduces a novel FEM framework with SQL-based operators for graph search, enhanced by new optimization strategies tailored for large-scale graph shortest path discovery.
Findings
Achieves high scalability and performance in large graph shortest path tasks.
Utilizes SQL features like window functions and merge statements to simplify and optimize graph search.
Bi-directional Dijkstra and SegTable indexing significantly reduce search space and improve efficiency.
Abstract
With the rapid growth of large graphs, we cannot assume that graphs can still be fully loaded into memory, thus the disk-based graph operation is inevitable. In this paper, we take the shortest path discovery as an example to investigate the technique issues when leveraging existing infrastructure of relational database (RDB) in the graph data management. Based on the observation that a variety of graph search queries can be implemented by iterative operations including selecting frontier nodes from visited nodes, making expansion from the selected frontier nodes, and merging the expanded nodes into the visited ones, we introduce a relational FEM framework with three corresponding operators to implement graph search tasks in the RDB context. We show new features such as window function and merge statement introduced by recent SQL standards can not only simplify the expression but also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Advanced Database Systems and Queries · Data Mining Algorithms and Applications
