Output-Optimal Algorithms for Join-Aggregate Queries
Xiao Hu

TL;DR
This paper establishes the first output-optimal algorithms for acyclic join-aggregate queries, providing tight bounds that improve upon the classic Yannakakis algorithm and resolving a longstanding open problem.
Contribution
It introduces the use of free-connex fractional hypertree width to characterize output-optimal complexity for acyclic join-aggregate queries, achieving polynomial improvements.
Findings
Matching lower and upper bounds for query computation complexity
First polynomial improvement over Yannakakis algorithm in 40 years
Complete resolution of the open problem for output-optimal algorithms
Abstract
One of the most celebrated results of computing join-aggregate queries defined over commutative semi-rings is the classic Yannakakis algorithm proposed in 1981. It is known that the runtime of the Yannakakis algorithm is for any free-connex query, where is the input size of the database and is the output size of the query result. This is already output-optimal. However, only an upper bound on the runtime is known for the large remaining class of acyclic but non-free-connex queries. Alternatively, one can convert a non-free-connex query into a free-connex one using tree decomposition techniques and then run the Yannakakis algorithm. This approach takes time, where is the {\em free-connex sub-modular width} of the input query. But, none of these results is known to be output-optimal. In this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Advanced Database Systems and Queries · Energy Efficient Wireless Sensor Networks
