Efficient Path Query Processing in Relational Database Systems
Diego Rivera Correa, Mirek Riedewald

TL;DR
This paper introduces ReCAP, a relational optimizer extension that enables early filtering of path queries in relational databases, significantly improving performance over existing graph and relational DBMS.
Contribution
ReCAP provides a simple abstraction allowing relational DBMS to efficiently evaluate complex path queries with property constraints by explicit early filtering.
Findings
ReCAP enables pushing property constraints deep into query plans.
Using ReCAP, DuckDB outperforms state-of-the-art graph and relational DBMS by up to 400,000 times.
ReCAP simplifies implementation by requiring only an NFA-style transition function and a few auxiliary functions.
Abstract
Path queries are crucial for property graphs, and there is growing interest in queries that combine regular expressions over labels with constraints on property values of vertices and edges. Efficient evaluation of such general path queries requires that intermediate results be eliminated early when there is no possible completion to a full result path. Neither state-of-the-art (SOA) graph DBMS nor relational DBMS currently can do this effectively for a large class of queries. We show that this problem can be addressed by giving a relational optimizer ``a little help'' by specifying early filtering opportunities explicitly in the query. To this end, we propose ReCAP, an abstraction that greatly simplifies the implementation of early filtering techniques for any type of property constraint for which such early filtering can be derived. No matter how complex the constraint, one only needs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
