PANDAExpress: a Simpler and Faster PANDA Algorithm
Mahmoud Abo Khamis, Hung Q. Ngo, Dan Suciu

TL;DR
PANDAExpress is a new algorithm that simplifies and accelerates the PANDA framework for conjunctive queries and disjunctive datalog rules, removing the polylogarithmic factor and matching specialized algorithms' runtimes.
Contribution
It introduces a novel probabilistic inequality and a dynamic hyperplane partitioning scheme to significantly improve PANDA's efficiency while maintaining its generality.
Findings
PANDAExpress removes the polylogarithmic runtime factor from PANDA.
It matches the optimal runtimes of specialized algorithms for certain problems.
The new partitioning scheme adapts to data skewness for better performance.
Abstract
PANDA is a powerful generic algorithm for answering conjunctive queries (CQs) and disjunctive datalog rules (DDRs) given input degree constraints. In the special case where degree constraints are cardinality constraints and the query is Boolean, PANDA runs in -time, where is the input size, and is the submodular width of the query, a notion introduced by Daniel Marx (JACM 2013). When specialized to certain classes of sub-graph pattern finding problems, the runtime matches the optimal runtime possible, modulo some conjectures in fine-grained complexity (Bringmann and Gorbachev (STOC 25)). The PANDA framework is much more general, as it handles arbitrary input degree constraints, which capture common statistics and integrity constraints used in relational database management systems, it works for queries with free variables, and for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
