The Optimal Quantile Estimator for Compressed Counting
Ping Li

TL;DR
This paper introduces an optimal quantile estimator for Compressed Counting, improving efficiency and accuracy over previous estimators, especially for data streams where alpha exceeds 1, with applications in entropy computation.
Contribution
It proposes a new optimal quantile estimator for Compressed Counting that is more efficient and accurate than existing estimators for alpha greater than 1.
Findings
The optimal quantile estimator outperforms previous estimators in accuracy.
It reduces computational complexity compared to earlier methods.
Effective for computing entropy in data streams.
Abstract
Compressed Counting (CC) was recently proposed for very efficiently computing the (approximate) th frequency moments of data streams, where . Several estimators were reported including the geometric mean estimator, the harmonic mean estimator, the optimal power estimator, etc. The geometric mean estimator is particularly interesting for theoretical purposes. For example, when , the complexity of CC (using the geometric mean estimator) is , breaking the well-known large-deviation bound . The case has important applications, for example, computing entropy of data streams. For practical purposes, this study proposes the optimal quantile estimator. Compared with previous estimators, this estimator is computationally more efficient and is also more accurate when .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Machine Learning and Algorithms · Data Stream Mining Techniques
