Partition Constraints for Conjunctive Queries: Bounds and Worst-Case Optimal Joins
Kyle Deeds, Timo Camillo Merkl

TL;DR
This paper introduces partition constraints as a new statistical tool to better bound conjunctive query output sizes and enhance worst-case optimal join algorithms by capturing latent relation structures.
Contribution
It proposes partition constraints to refine bounds and improve join algorithms, extending the use of relation statistics in query optimization.
Findings
Partition constraints tighten size bounds of query outputs.
Partition constraints lead to more efficient join algorithms.
The approach captures latent relation structures for better query planning.
Abstract
In the last decade, various works have used statistics on relations to improve both the theory and practice of conjunctive query execution. Starting with the AGM bound which took advantage of relation sizes, later works incorporated statistics like functional dependencies and degree constraints. Each new statistic prompted work along two lines; bounding the size of conjunctive query outputs and worst-case optimal join algorithms. In this work, we continue in this vein by introducing a new statistic called a \emph{partition constraint}. This statistic captures latent structure within relations by partitioning them into sub-relations which each have much tighter degree constraints. We show that this approach can both refine existing cardinality bounds and improve existing worst-case optimal join algorithms.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
