Approaching 100% Confidence in Stream Summary through ReliableSketch
Yuhan Wu, Hanbo Wu, Xilai Liu, Yikai Zhao, Tong Yang, Kaicheng Yang,, Sha Wang, Lihua Miao, Gaogang Xie

TL;DR
ReliableSketch is a new data stream sketch that guarantees high-confidence, low-error approximations for all keys simultaneously, with efficient time and space complexity, and is practical for hardware implementation.
Contribution
It introduces ReliableSketch, the first sketch controlling error for all keys with low failure probability, and demonstrates its efficiency and hardware-friendliness through implementation and experiments.
Findings
Guarantees error below Λ for all keys with high probability
Achieves near-optimal throughput in experiments
Requires only small space and time overhead
Abstract
To approximate sums of values in key-value data streams, sketches are widely used in databases and networking systems. They offer high-confidence approximations for any given key while ensuring low time and space overhead. While existing sketches are proficient in estimating individual keys, they struggle to maintain this high confidence across all keys collectively, an objective that is critically important in both algorithm theory and its practical applications. We propose ReliableSketch, the first to control the error of all keys to less than with a small failure probability , requiring only amortized time and space. Furthermore, its simplicity makes it hardware-friendly, and we implement it on CPU servers, FPGAs, and programmable switches. Our experiments show that under the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Advanced Database Systems and Queries · Time Series Analysis and Forecasting
