Estimating small moments of data stream in nearly optimal space-time
Sumit Ganguly

TL;DR
This paper introduces a nearly optimal randomized algorithm for estimating small moments of data streams with improved space and time efficiency, advancing the state of streaming algorithms.
Contribution
It presents the first nearly optimal algorithm for small moment estimation in data streams, with a novel technique to separate heavy hitters and light elements.
Findings
Achieves space complexity close to the lower bound.
Processes each update efficiently with low time complexity.
Provides unbiased estimators with low variance for frequency moments.
Abstract
For each , we present a randomized algorithm that returns an -approximation of the th frequency moment of a data stream . The algorithm requires space and processes each stream update using time . It is nearly optimal in terms of space (lower bound as well as time and is the first algorithm with these properties. The technique separates heavy hitters from the remaining items in the stream using an appropriate threshold and estimates the contribution of the heavy hitters and the light elements to separately. A key component is the design of an unbiased estimator for whose data structure has low update time and low variance.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Distributed Sensor Networks and Detection Algorithms · Advanced Statistical Process Monitoring
