HyperLogLogLog: Cardinality Estimation With One Log More
Matti Karppa, Rasmus Pagh

TL;DR
HyperLogLogLog introduces a highly compressed, mergeable sketch for cardinality estimation that reduces space usage by about 40% while maintaining estimation accuracy and efficiency.
Contribution
It presents a novel compression method for HyperLogLog sketches, significantly reducing space while preserving all estimation and merge properties.
Findings
Achieved approximately 40% reduction in sketch size.
Maintains competitive update and merge times.
Provides a practical implementation with experimental validation.
Abstract
We present HyperLogLogLog, a practical compression of the HyperLogLog sketch that compresses the sketch from bits down to bits for estimating the number of distinct elements~ using ~registers. The algorithm works as a drop-in replacement that preserves all estimation properties of the HyperLogLog sketch, it is possible to convert back and forth between the compressed and uncompressed representations, and the compressed sketch maintains mergeability in the compressed domain. The compressed sketch can be updated in amortized constant time, assuming is sufficiently larger than . We provide a C++ implementation of the sketch, and show by experimental evaluation against well-known implementations by Google and Apache that our implementation provides small sketches while maintaining competitive update and merge times.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · Video Analysis and Summarization · Parallel Computing and Optimization Techniques
