On Estimating the First Frequency Moment of Data Streams
Sumit Ganguly, Purushottam Kar

TL;DR
This paper introduces a new algorithm for estimating the first frequency moment in data streams that optimizes both space and update time, improving upon previous methods.
Contribution
The paper presents a novel algorithm that achieves near-optimal space and update processing time for estimating the first frequency moment in data streams.
Findings
Achieves space complexity of O(ε^{-2}(log n + (log ε^{-1}) log(mM)))
Per-update processing time of O((log n) log(ε^{-1}))
Improves upon previous algorithms by balancing space and time efficiency.
Abstract
Estimating the first moment of a data stream defined as to within -relative error with high probability is a basic and influential problem in data stream processing. A tight space bound of is known from the work of [Kane-Nelson-Woodruff-SODA10]. However, all known algorithms for this problem require per-update stream processing time of , with the only exception being the algorithm of [Ganguly-Cormode-RANDOM07] that requires per-update processing time of albeit with sub-optimal space . In this paper, we present an algorithm for estimating that achieves near-optimality in both space and update processing time. The space requirement is and the per-update…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Database Systems and Queries · Algorithms and Data Compression · Data Stream Mining Techniques
