A General Cardinality Estimation Framework for Subgraph Matching in Property Graphs
Wilco van Leeuwen, George Fletcher, Nikolay Yakovets

TL;DR
This paper introduces a comprehensive framework for estimating the number of results in subgraph matching queries over property graphs, enabling analysis, comparison, and combination of various estimation techniques.
Contribution
It proposes a three-phase, modular framework that unifies existing approaches and facilitates the development of new techniques for accurate cardinality estimation in property graph databases.
Findings
Existing approaches can be integrated into the framework's phases.
Accurate estimates for small patterns without property constraints.
Need for new techniques to handle property constraints effectively.
Abstract
Many techniques have been developed for the cardinality estimation problem in data management systems. In this document, we introduce a framework for cardinality estimation of query patterns over property graph databases, which makes it possible to analyze, compare and combine different cardinality estimation approaches. This framework consists of three phases: obtaining a set of estimates for some subqueries, extending this set and finally combining the set into a single cardinality estimate for the query. We show that (parts of) many of the existing cardinality estimation approaches can be used as techniques in one of the phases from our framework. The three phases are loosely coupled, this makes it possible to combine (parts of) current cardinality estimation approaches. We create a graph version of the Join Order Benchmark to perform experiments with different combinations of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Advanced Database Systems and Queries · Data Management and Algorithms
