High Probability Frequency Moment Sketches
Sumit Ganguly, David P. Woodruff

TL;DR
This paper establishes tight bounds for high-confidence frequency moment sketches for p>2, showing that naive repetition methods are suboptimal and providing optimal algorithms and bounds.
Contribution
It introduces the first tight bounds for high-confidence frequency moment sketches for p>2, demonstrating the suboptimality of simple repetition methods and providing optimal algorithms.
Findings
Optimal upper and lower bounds on sketching dimension for p>2.
Repetition with median is suboptimal for high-confidence frequency moments.
Matching lower bounds up to constant factors.
Abstract
We consider the problem of sketching the -th frequency moment of a vector, , with multiplicative error at most and \emph{with high confidence} . Despite the long sequence of work on this problem, tight bounds on this quantity are only known for constant . While one can obtain an upper bound with error probability by repeating a sketching algorithm with constant error probability times in parallel, and taking the median of the outputs, we show this is a suboptimal algorithm! Namely, we show optimal upper and lower bounds of on the sketching dimension, for any constant approximation. Our result should be contrasted with results for estimating frequency moments for , for which we show the optimal algorithm for general …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
