Cuckoo Heavy Keeper and the balancing act of maintaining heavy hitters in stream processing
Vinh Quang Ngo, Marina Papatriantafilou

TL;DR
The paper introduces Cuckoo Heavy Keeper, a novel streaming algorithm for heavy hitter detection that balances scalability, accuracy, and efficiency, outperforming existing methods especially in parallel and low-memory scenarios.
Contribution
It proposes a new inverted process approach for heavy hitter detection, enabling scalable parallelization and improved accuracy over state-of-the-art algorithms.
Findings
CHK improves throughput by 1.7-5.7× over existing methods.
CHK achieves up to four orders of magnitude better accuracy.
Parallel instances of CHK scale nearly linearly with low latency.
Abstract
Finding heavy hitters in databases and data streams is a fundamental problem with applications ranging from network monitoring to database query optimization, machine learning, and more. Approximation algorithms offer practical solutions, but they present trade-offs involving throughput, memory usage, and accuracy. Moreover, modern applications further complicate these trade-offs by demanding capabilities beyond sequential processing that require both parallel scaling and support for concurrent queries and updates. Analysis of these trade-offs led us to the key idea behind our proposed streaming algorithm, Cuckoo Heavy Keeper (CHK). The approach introduces an inverted process for distinguishing frequent from infrequent items, which unlocks new algorithmic synergies that were previously inaccessible with conventional approaches. By further analyzing the competing metrics with a focus…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsIndustrial Engineering and Technologies · Engineering Diagnostics and Reliability · Engineering Technology and Methodologies
