Range Queries on Uncertain Data
Jian Li, Haitao Wang

TL;DR
This paper develops efficient data structures for range queries on uncertain data points on the real line, enabling quick retrieval of top probabilities, top-k points, or points exceeding a probability threshold within a query interval.
Contribution
It introduces novel data structures with linear or near-linear space complexity for answering various range queries on uncertain data points.
Findings
Data structures support efficient range queries on uncertain data
Linear or near-linear space complexity achieved
Query times are optimized for different query types
Abstract
Given a set of uncertain points on the real line, each represented by its one-dimensional probability density function, we consider the problem of building data structures on to answer range queries of the following three types for any query interval : (1) top- query: find the point in that lies in with the highest probability, (2) top- query: given any integer as part of the query, return the points in that lie in with the highest probabilities, and (3) threshold query: given any threshold as part of the query, return all points of that lie in with probabilities at least . We present data structures for these range queries with linear or nearly linear space and efficient query time.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Management and Algorithms · Algorithms and Data Compression · Machine Learning and Algorithms
