Revisiting Frequency Moment Estimation in Random Order Streams
Vladimir Braverman, Emanuele Viola, David Woodruff, Lin F. Yang

TL;DR
This paper demonstrates that in the random-order data stream model, frequency moment estimation can be significantly more space-efficient than in the turnstile model, achieving near-optimal deterministic algorithms with substantially reduced space complexity.
Contribution
The authors show that in the random-order model, frequency moment estimation can be done with much less space, and they provide a deterministic algorithm with near-optimal bounds.
Findings
Achieves $ ilde{O}( ext{epsilon}^{-2} + ext{log} n)$ space for frequency moments in random-order streams.
Provides a deterministic algorithm with near-optimal space bounds in the random-order model.
Shows quadratic improvement over turnstile model when $ ext{epsilon}^{-2} oughly ext{log} n$.
Abstract
We revisit one of the classic problems in the data stream literature, namely, that of estimating the frequency moments for of an underlying -dimensional vector presented as a sequence of additive updates in a stream. It is well-known that using -stable distributions one can approximate any of these moments up to a multiplicative -factor using bits of space, and this space bound is optimal up to a constant factor in the turnstile streaming model. We show that surprisingly, if one instead considers the popular random-order model of insertion-only streams, in which the updates to the underlying vector arrive in a random order, then one can beat this space bound and achieve bits of space, where the hides poly factors. If $\epsilon^{-2} \approx…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
