Frequency Estimation with One-Sided Error
Piotr Indyk, Shyam Narayanan, David P. Woodruff

TL;DR
This paper investigates whether a frequency estimation sketch can combine Count-Min's no-underestimation property with Count-Sketch's error guarantees, concluding such a combination is impossible in key computational models.
Contribution
The paper proves the impossibility of creating a frequency sketch with combined properties in linear sketching and streaming models.
Findings
No combined sketch with both properties exists in linear models.
Impossibility results hold in streaming algorithms.
Study of complementary over-estimation constraints.
Abstract
Frequency estimation is one of the most fundamental problems in streaming algorithms. Given a stream of elements from some universe , the goal is to compute, in a single pass, a short sketch of so that for any element , one can estimate the number of times occurs in based on the sketch alone. Two state of the art solutions to this problems are the Count-Min and Count-Sketch algorithms. The frequency estimator produced by Count-Min, using dimensions, guarantees that with high probability, and holds deterministically. Also, Count-Min works under the assumption that . On the other hand, Count-Sketch, using dimensions, guarantees that with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
