Efficient Massively Parallel Join Optimization for Large Queries
Riccardo Mancini, Srinivas Karthik, Bikash Chandra, Vasilis, Mageirakos, Anastasia Ailamaki

TL;DR
This paper introduces MPDP, a massively parallel algorithm leveraging modern hardware to significantly speed up join order optimization for large queries, resulting in better plans and faster computation.
Contribution
The paper presents MPDP, a novel parallel join optimization algorithm that efficiently prunes the search space and improves plan quality for large queries.
Findings
MPDP is at least ten times faster than existing methods.
Increased heuristic limit from 12 to 25 relations within the same time budget.
Enhanced heuristics (IDP2 and UnionDP) explore larger search spaces, producing cheaper plans.
Abstract
Modern data analytical workloads often need to run queries over a large number of tables. An optimal query plan for such queries is crucial for being able to run these queries within acceptable time bounds. However, with queries involving many tables, finding the optimal join order becomes a bottleneck in query optimization. Due to the exponential nature of join order optimization, optimizers resort to heuristic solutions after a threshold number of tables. Our objective is two fold: (a) reduce the optimization time for generating optimal plans; and (b) improve the quality of the heuristic solution. In this paper, we propose a new massively parallel algorithm, MPDP, that can efficiently prune the large search space (via a novel plan enumeration technique) while leveraging the massive parallelism offered by modern hardware (Eg: GPUs). When evaluated on real-world benchmark queries with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Advanced Database Systems and Queries · Graph Theory and Algorithms
