Glue: Adaptively Merging Single Table Cardinality to Estimate Join Query Size
Rong Zhu, Tianjing Zeng, Andreas Pfadler, Wei Chen, Bolin Ding,, Jingren Zhou

TL;DR
This paper introduces Glue, a flexible framework that combines single-table cardinality estimates to accurately and efficiently estimate join query sizes across complex schemas, adaptable to various performance needs.
Contribution
Glue provides a general, decoupled approach to merge single-table cardinality estimates for join size estimation, supporting any existing method and complex schemas.
Findings
Supports any existing CardEst method for single tables
Adapts to different scenarios like OLTP and OLAP
Seamlessly integrates into plan search and supports distinct count estimation
Abstract
Cardinality estimation (CardEst), a central component of the query optimizer, plays a significant role in generating high-quality query plans in DBMS. The CardEst problem has been extensively studied in the last several decades, using both traditional and ML-enhanced methods. Whereas, the hardest problem in CardEst, i.e., how to estimate the join query size on multiple tables, has not been extensively solved. Current methods either reply on independence assumptions or apply techniques with heavy burden, whose performance is still far from satisfactory. Even worse, existing CardEst methods are often designed to optimize one goal, i.e., inference speed or estimation accuracy, which can not adapt to different occasions. In this paper, we propose a very general framework, called Glue, to tackle with these challenges. Its key idea is to elegantly decouple the correlations across different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Data Quality and Management · Data Management and Algorithms
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
