Efficient and Provable Multi-Query Optimization

Tarun Kathuria; S. Sudarshan

arXiv:1512.02568·cs.DB·January 20, 2017

Efficient and Provable Multi-Query Optimization

Tarun Kathuria, S. Sudarshan

PDF

Open Access

TL;DR

This paper introduces a new greedy algorithm for multi-query optimization that maximizes a linear transformation of the cost function, providing provable approximation guarantees and practical efficiency improvements.

Contribution

It reformulates the MQO problem to enable a greedy algorithm with theoretical approximation guarantees, unlike previous heuristic methods.

Findings

01

The proposed algorithm offers an approximation factor guarantee.

02

The algorithm can be integrated into existing optimizers.

03

Efficiency optimizations improve practical performance.

Abstract

Complex queries for massive data analysis jobs have become increasingly commonplace. Many such queries contain com- mon subexpressions, either within a single query or among multiple queries submitted as a batch. Conventional query optimizers do not exploit these subexpressions and produce sub-optimal plans. The problem of multi-query optimization (MQO) is to generate an optimal combined evaluation plan by computing common subexpressions once and reusing them. Exhaustive algorithms for MQO explore an O(n^n) search space. Thus, this problem has primarily been tackled using various heuristic algorithms, without providing any theoretical guarantees on the quality of their solution. In this paper, instead of the conventional cost minimization problem, we treat the problem as maximizing a linear transformation of the cost function. We propose a greedy algorithm for this transformed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplexity and Algorithms in Graphs · Data Management and Algorithms · Optimization and Search Problems