Estimating small frequency moments of data stream: a characteristic function approach
Sumit Ganguly, Purushottam Kar

TL;DR
This paper introduces a simplified analysis of the log-cosine estimator for estimating small frequency moments in data streams, improving understanding of a space-efficient method for this problem.
Contribution
It provides an elementary, stand-alone analysis of the log-cosine estimator, which was previously analyzed through more complex methods.
Findings
Simplified analysis of the log-cosine estimator.
The estimator is space-optimal for small frequency moments.
Enhanced understanding of frequency moment estimation techniques.
Abstract
A data stream is viewed as a sequence of updates of the form to an -dimensional integer frequency vector , where the update changes to , and is an integer and assumed to be in . The th frequency moment is defined as . We consider the problem of estimating to within a multiplicative approximation factor of , for . Several estimators have been proposed for this problem, including Indyk's median estimator \cite{indy:focs00}, Li's geometric means estimator \cite{pinglib:2006}, an \Hss-based estimator \cite{gc:random07}. The first two estimators require space , where the notation hides polylogarithmic factors in and . Recently, Kane, Nelson and Woodruff in \cite{knw:soda10} present a space-optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Algorithms and Data Compression · Data Management and Algorithms
