Efficient and Provable Multi-Query Optimization
Tarun Kathuria, S. Sudarshan

TL;DR
This paper introduces a new greedy algorithm for multi-query optimization that maximizes a linear transformation of the cost function, providing provable approximation guarantees and practical efficiency improvements.
Contribution
It reformulates the MQO problem to enable a greedy algorithm with theoretical approximation guarantees, unlike previous heuristic methods.
Findings
The proposed algorithm offers an approximation factor guarantee.
The algorithm can be integrated into existing optimizers.
Efficiency optimizations improve practical performance.
Abstract
Complex queries for massive data analysis jobs have become increasingly commonplace. Many such queries contain com- mon subexpressions, either within a single query or among multiple queries submitted as a batch. Conventional query optimizers do not exploit these subexpressions and produce sub-optimal plans. The problem of multi-query optimization (MQO) is to generate an optimal combined evaluation plan by computing common subexpressions once and reusing them. Exhaustive algorithms for MQO explore an O(n^n) search space. Thus, this problem has primarily been tackled using various heuristic algorithms, without providing any theoretical guarantees on the quality of their solution. In this paper, instead of the conventional cost minimization problem, we treat the problem as maximizing a linear transformation of the cost function. We propose a greedy algorithm for this transformed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplexity and Algorithms in Graphs · Data Management and Algorithms · Optimization and Search Problems
