MagnifierSketch: Quantile Estimation Centered at One Point
Jiarui Guo, Qiushi Lyu, Yuhan Wu, Haoyu Li, Zhaoqian Yao, Yuqi Dong, Xiaolin Wang, Bin Cui, Tong Yang

TL;DR
MagnifierSketch is a novel algorithm designed for accurate, high-throughput quantile estimation centered at a specific point in data streams, improving accuracy and efficiency over existing methods.
Contribution
It introduces MagnifierSketch, a new method for point-quantile estimation supporting both single-key and per-key scenarios with proven unbiasedness and optimized complexity.
Findings
Significantly lower average error than state-of-the-art methods.
Supports high-speed data streams with improved throughput.
Reduces quantile query latency in real database systems.
Abstract
In this paper, we take into consideration quantile estimation in data stream models, where every item in the data stream is a key-value pair. Researchers sometimes aim to estimate per-key quantiles (i.e. quantile estimation for every distinct key), and some popular use cases, such as tail latency measurement, recline on a predefined single quantile (e.g. 0.95- or 0.99- quantile) rather than demanding arbitrary quantile estimation. However, existing algorithms are not specially designed for per-key estimation centered at one point. They cannot achieve high accuracy in our problem setting, and their throughput are not satisfactory to handle high-speed items in data streams. To solve this problem, we propose MagnifierSketch for point-quantile estimation. MagnifierSketch supports both single-key and per-key quantile estimation, and its key techniques are named Value Focus, Distribution…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Data Quality and Management · Data Management and Algorithms
