Probably Approximately Optimal Query Optimization
Immanuel Trummer, Christoph Koch

TL;DR
This paper introduces a new probabilistic approach to query optimization called PAO, which finds near-optimal query plans with high confidence using iterative sampling and standard query optimizers.
Contribution
It presents the first algorithm for probably approximately optimal query optimization that is generic, iterative, and integrates sampling with existing query optimization tools.
Findings
The algorithm effectively balances sampling and optimization to find near-optimal plans.
Experimental results show reduced number of samples and optimizer calls.
Different algorithm variants offer trade-offs in complexity and efficiency.
Abstract
Evaluating query predicates on data samples is the only way to estimate their selectivity in certain scenarios. Finding a guaranteed optimal query plan is not a reasonable optimization goal in those cases as it might require an infinite number of samples. We therefore introduce probably approximately optimal query optimization (PAO) where the goal is to find a query plan whose cost is near-optimal with a certain probability. We will justify why PAO is a suitable formalism to model scenarios in which predicate sampling and optimization need to be interleaved. We present the first algorithm for PAO. Our algorithm is non-intrusive and uses standard query optimizers and sampling components as sub-functions. It is generic and can be applied to a wide range of scenarios. Our algorithm is iterative and calculates in each iteration a query plan together with a region in the selectivity space…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Mobile Crowdsensing and Crowdsourcing · Machine Learning and Algorithms
