Simpli-Squared: A Very Simple Yet Unexpectedly Powerful Join Ordering Algorithm Without Cardinality Estimates
Asoke Datta, Yesdaulet Izenov, Brian Tsan, Florin Rusu

TL;DR
Simpli-Squared is a simple join ordering algorithm that forgoes cardinality estimates, yet achieves performance close to complex methods, questioning the necessity of detailed statistics in query optimization.
Contribution
Introduces Simpli-Squared, a lightweight join ordering method that relies only on schema constraints and table sizes, avoiding complex statistics and correlations.
Findings
Achieves up to 16% performance increase over complex methods
Requires significantly less overhead to implement and maintain
Challenges the importance of cardinality estimates in join optimization
Abstract
The Join Order Benchmark (JOB) has become the de facto standard to assess the performance of relational database query optimizers due to its complexity and completeness. In order to compute the optimal execution plan -- join order -- existing solutions employ extensive data synopses and correlations -- functional dependencies -- between table attributes. These structures incur significant overhead to design, build, and maintain. In this paper, we present \textit{Simpli}city \textit{Simpli}fied (\textit{Simpli-Squared}), a very simple join ordering algorithm that achieves unexpectedly good results. Simpli-Squared computes the join order without using any statistics or cardinality estimates. It takes as input only the referential integrity constraints declared at schema definition and the number of tuples (size) in the base tables. The join order of a given query is computed by splitting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Data Management and Algorithms · Data Quality and Management
