SpaceSaving$^\pm$: An Optimal Algorithm for Frequency Estimation and Frequent items in the Bounded Deletion Model
Fuheng Zhao, Divyakant Agrawal, Amr El Abbadi, Ahmed Metwally

TL;DR
This paper introduces SpaceSaving$^pm$, an optimal deterministic algorithm for frequency estimation and frequent items detection in the bounded deletion model, achieving high accuracy with minimal space and latency.
Contribution
It presents the first deterministic algorithms with optimal space bounds for these problems in the bounded deletion model and introduces Dyadic SpaceSaving$^pm$, a deterministic quantile sketch.
Findings
SpaceSaving$^pm$ achieves high accuracy and recall in experiments.
The algorithms operate with minimal space and low latency.
Dyadic SpaceSaving$^pm$ is the first deterministic quantile sketch in this model.
Abstract
In this paper, we propose the first deterministic algorithms to solve the frequency estimation and frequent item problems in the bounded deletion model. We establish the space lower bound for solving the deterministic frequent items problem in the bounded deletion model, and propose the Lazy SpaceSaving and SpaceSaving algorithms with optimal space bound. We then develop an efficient implementation of the SpaceSaving algorithm that minimizes the latency of update operations using novel data structures. The experimental evaluations testify that SpaceSaving has accurate frequency estimations and achieves very high recall and precision across different data distributions while using minimal space. Our analysis and experiments clearly demonstrate that SpaceSaving provides more accurate estimations using the same space as the state of the art protocols for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Data Mining Algorithms and Applications · Algorithms and Data Compression
