Separations for Estimating Large Frequency Moments on Data Streams
David P. Woodruff, Samson Zhou

TL;DR
This paper develops space-efficient algorithms for estimating large frequency moments in data streams, with improved bounds in random-order models and new lower bounds, highlighting differences between stream models.
Contribution
It introduces new algorithms with better space complexity for $F_p$ estimation in random-order streams and establishes lower bounds, advancing understanding of stream model complexities.
Findings
Improved space complexity algorithms for $F_p$ estimation in random-order streams.
First optimal multi-pass $F_p$ estimation algorithms up to logarithmic factors.
New lower bounds for one-pass insertion-only streams.
Abstract
We study the classical problem of moment estimation of an underlying vector whose coordinates are implicitly defined through a series of updates in a data stream. We show that if the updates to the vector arrive in the random-order insertion-only model, then there exist space efficient algorithms with improved dependencies on the approximation parameter . In particular, for any real , we first obtain an algorithm for moment estimation using bits of memory. Our techniques also give algorithms for moment estimation with on arbitrary order insertion-only and turnstile streams, using bits of space and two passes, which is the first optimal multi-pass estimation algorithm up to …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
