Independent Range Sampling on Interval Data (Longer Version)
Daichi Amagata

TL;DR
This paper introduces an efficient method for independent range sampling on interval data using an augmented interval tree, enabling fast, scalable, and flexible sampling for large datasets with practical applications.
Contribution
It proposes a novel augmented interval tree structure and algorithms for independent range sampling, improving efficiency and flexibility over existing methods.
Findings
Algorithms need $O(n \, \log n)$ space and $O(\log^{2} n + s)$ time.
Extensions allow $O(\log^{2} n + s)$ expected and $O(n)$ space complexities.
Experiments show significant performance improvements over competitors.
Abstract
Many applications require efficient management of large sets of intervals because many objects are associated with intervals (e.g., time and price intervals). In such interval management systems, range search is a primitive operator for retrieving and analysis tasks. As dataset sizes are growing nowadays, range search results are also becoming larger, which may overwhelm users and incur long computation time. Because applications are usually satisfied with a subset of the result set, it is desirable to efficiently obtain only small samples from the result set.We therefore address the problem of independent range sampling on interval data, which outputs random samples that overlap a given query interval and are independent of the samples of all previous queries. To efficiently solve this problem theoretically and practically, we propose a variant of an interval tree, namely the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems · Target Tracking and Data Fusion in Sensor Networks · Advanced Statistical Methods and Models
