Learning-Based Heavy Hitters and Flow Frequency Estimation in Streams
Rana Shahout, Michael Mitzenmacher

TL;DR
This paper introduces LSS, a novel machine learning-enhanced competing-counter algorithm based on Space Saving, which improves heavy hitter detection and flow frequency estimation in network streams.
Contribution
It presents the first learned competing-counter algorithm, LSS, combining machine learning with Space Saving for improved accuracy and efficiency in frequency estimation tasks.
Findings
LSS outperforms traditional Space Saving in accuracy.
LSS demonstrates improved efficiency on synthetic and real datasets.
Theoretical analysis supports the effectiveness of LSS.
Abstract
Identifying heavy hitters and estimating the frequencies of flows are fundamental tasks in various network domains. Existing approaches to this challenge can broadly be categorized into two groups, hashing-based and competing-counter-based. The Count-Min sketch is a standard example of a hashing-based algorithm, and the Space Saving algorithm is an example of a competing-counter algorithm. Recent works have explored the use of machine learning to enhance algorithms for frequency estimation problems, under the algorithms with prediction framework. However, these works have focused solely on the hashing-based approach, which may not be best for identifying heavy hitters. In this paper, we present the first learned competing-counter-based algorithm, called LSS, for identifying heavy hitters, top k, and flow frequency estimation that utilizes the well-known Space Saving algorithm. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Lattice Boltzmann Simulation Studies · Anomaly Detection Techniques and Applications
