Optimizing XML querying using type-based document projection
V\'eronique Benzaken, Giuseppe Castagna, Dario Colazzo, Kim, Nguyen

TL;DR
This paper introduces a type-based XML document projection method that enhances query optimization by accurately pruning unnecessary data, supporting complex XPath features, and improving performance in memory and native XML databases.
Contribution
A novel type system for XPath that enables efficient, precise, and comprehensive document pruning, including backward axes and multiple queries, with formal soundness and completeness proofs.
Findings
Improved query execution performance in main memory engines.
Enhanced pruning precision with lower overhead.
Better performance in native XML databases.
Abstract
XML data projection (or pruning) is a natural optimization for main memory query engines: given a query Q over a document D, the subtrees of D that are not necessary to evaluate Q are pruned, thus producing a smaller document D'; the query Q is then executed on D', hence avoiding to allocate and process nodes that will never be reached by Q. In this article, we propose a new approach, based on types, that greatly improves current solutions. Besides providing comparable or greater precision and far lesser pruning overhead, our solution ---unlike current approaches--- takes into account backward axes, predicates, and can be applied to multiple queries rather than just to single ones. A side contribution is a new type system for XPath able to handle backward axes. The soundness of our approach is formally proved. Furthermore, we prove that the approach is also complete (i.e., yields the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
