Demythization of Structural XML Query Processing: Comparison of Holistic and Binary Approaches, Technical Report
Petr Luk\'a\v{s}, Radim Ba\v{c}a, Michal Kr\'atk\'y, Tok Wang, Ling

TL;DR
This paper compares holistic and binary approaches for XML query processing, demonstrating that optimized binary joins often outperform holistic methods, especially for complex queries with many non-output nodes, and introduces improvements and a combined approach.
Contribution
It introduces enhancements to binary join methods for XML queries, proving their efficiency and linear complexity, and proposes a hybrid approach combining both methods.
Findings
Binary joins can outperform holistic joins in many cases.
Optimized binary plans have linear time and space complexity.
A combined approach leverages advantages of both methods.
Abstract
XML query can be modeled by twig pattern query (TPQ) specifying predicates on XML nodes and XPath relationships satisfied between them. A lot of TPQ types have been proposed; this paper takes into account a TPQ model extended by a specification of output and non-output query nodes since it complies with the XQuery semantics and, in many cases, it leads to a more efficient query processing. In general, there are two approaches to process the TPQ: holistic joins and binary joins. Whereas the binary join approach builds a query plan as a tree of interconnected binary operators, the holistic join approach evaluates a whole query using one operator (i.e., using one complex algorithm). Surprisingly, a thorough analytical and experimental comparison is still missing despite an enormous research effort in this area. In this paper, we try to fill this gap; we analytically and experimentally show…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Data Management and Algorithms · Semantic Web and Ontologies
