Optimal Quantile Approximation in Streams
Zohar Karnin, Kevin Lang, Edo Liberty

TL;DR
This paper presents an optimal space-efficient algorithm for quantile approximation in data streams, resolving a long-standing open problem by matching upper and lower bounds.
Contribution
It introduces a novel, simplified merge-and-reduce technique that achieves optimal space complexity for quantile sketches, closing the gap between randomized and deterministic methods.
Findings
Achieves an $O((1/\varepsilon)\log \log (1/\delta))$ space complexity
Provides a matching lower bound, proving optimality
Demonstrates a simple, tight analysis of the new sketching technique
Abstract
This paper resolves one of the longest standing basic problems in the streaming computational model. Namely, optimal construction of quantile sketches. An approximate quantile sketch receives a stream of items and allows one to approximate the rank of any query up to additive error with probability at least . The rank of a query is the number of stream items such that . The minimal sketch size required for this task is trivially at least . Felber and Ostrovsky obtain a space sketch for a fixed . To date, no better upper or lower bounds were known even for randomly permuted streams or for approximating a specific quantile, e.g.,\ the median. This paper obtains an space sketch and a matching lower bound. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplexity and Algorithms in Graphs · Stochastic Gradient Optimization Techniques · Machine Learning and Algorithms
