Optimal Planar Range Skyline Reporting with Linear Space in External Memory
Yufei Tao, Jeonghun Yoon

TL;DR
This paper introduces optimal linear-space data structures for answering range skyline queries in external memory, achieving near-optimal I/O performance for various query types, and establishes complexity bounds for more challenging query variants.
Contribution
It presents the first linear-size external memory data structures with optimal I/O bounds for top-open and 3-sided skyline queries, and proves complexity lower bounds for left- and bottom-open queries.
Findings
Top-open queries can be answered in O(log_B(n) + k/B) I/Os.
Data points on a U x U grid enable faster query times, down to O(1 + k/B).
Left- and bottom-open queries are proven to be inherently harder, matching the complexity of general 4-sided queries.
Abstract
Let P be a set of n points in R^2. Given a rectangle Q = [\alpha_1, \alpha_2] x [\beta_1, \beta_2], a range skyline query returns the maxima of the points in P \cap Q. An important variant is the so-called top-open queries, where Q is a 3-sided rectangle whose upper edge is grounded at y = \infty (that is, \beta_2 = \infty). These queries are crucial in numerous database applications. In internal memory, extensive research has been devoted to designing data structures that can answer such queries efficiently. In contrast, currently there is no clear understanding about their exact complexities in external memory. This paper presents several structures of linear size for answering the above queries with the optimal I/O cost. We show that a top-open query can be solved in O(log_B(n) + k/B) I/Os, where B is the block size and k is the number of points in the query result. The query cost…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Computational Geometry and Mesh Generation · Constraint Satisfaction and Optimization
