Streaming Quantiles Algorithms with Small Space and Update Time
Nikita Ivkin, Edo Liberty, Kevin Lang, Zohar Karnin, Vladimir, Braverman

TL;DR
This paper presents practical variants of streaming quantile algorithms that significantly improve accuracy, reduce update times, and optimize memory usage, making them more efficient for real-world data stream processing.
Contribution
It introduces practical modifications to the asymptotically optimal quantile sketch, reducing error bounds, update times, and memory footprint, with experimental validation.
Findings
Error bound halved for given sketch size
Worst-case update time reduced from O(1/ε) to O(log(1/ε))
Memory footprint decreased through specialized data structures
Abstract
Approximating quantiles and distributions over streaming data has been studied for roughly two decades now. Recently, Karnin, Lang, and Liberty proposed the first asymptotically optimal algorithm for doing so. This manuscript complements their theoretical result by providing a practical variants of their algorithm with improved constants. For a given sketch size, our techniques provably reduce the upper bound on the sketch error by a factor of two. These improvements are verified experimentally. Our modified quantile sketch improves the latency as well by reducing the worst case update time from down to . We also suggest two algorithms for weighted item streams which offer improved asymptotic update times compared to na\"ive extensions. Finally, we provide a specialized data structure for these sketches which reduces both their memory…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Machine Learning and Data Classification
