Approximate counting with a floating-point counter
Miklos Csuros

TL;DR
This paper introduces a floating-point probabilistic counter that efficiently estimates large counts using minimal memory, improving upon Morris's original approximate counting method with a new unbiased estimator and detailed performance analysis.
Contribution
It presents a novel floating-point counter design with a simple unbiased estimator, extending Morris's approximate counting technique with enhanced accuracy and practical formulas for performance assessment.
Findings
Uses d + log log n bits for counting
Achieves an unbiased estimate with standard deviation ~0.6 * n * 2^{-d/2}
Provides a general performance analysis framework
Abstract
Memory becomes a limiting factor in contemporary applications, such as analyses of the Webgraph and molecular sequences, when many objects need to be counted simultaneously. Robert Morris [Communications of the ACM, 21:840--842, 1978] proposed a probabilistic technique for approximate counting that is extremely space-efficient. The basic idea is to increment a counter containing the value with probability . As a result, the counter contains an approximation of after probabilistic updates stored in bits. Here we revisit the original idea of Morris, and introduce a binary floating-point counter that uses a -bit significand in conjunction with a binary exponent. The counter yields a simple formula for an unbiased estimation of with a standard deviation of about , and uses bits. We analyze the floating-point…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Computability, Logic, AI Algorithms · Machine Learning and Algorithms
