Deterministic Heavy Hitters with Sublinear Query Time
Yi Li, Vasileios Nakos

TL;DR
This paper introduces a deterministic linear sketch for heavy hitters in streaming data, achieving sublinear query time with near state-of-the-art space complexity, and applies iterative techniques from sparse recovery.
Contribution
It presents the first deterministic linear sketch with sublinear query time for heavy hitters, using an iterative approach inspired by sparse recovery techniques.
Findings
Achieves $O(rac{1}{^2} \, ext{log} n \, ext{log}^*(rac{1}{}))$ row complexity.
Answers queries in sublinear time, significantly improving over previous linear-time algorithms.
Provides sublinear algorithms for related problems like group testing and compressed sensing.
Abstract
This paper studies the classic problem of finding heavy hitters in the turnstile streaming model. We give the first deterministic linear sketch that has rows and answers queries in sublinear time. The number of rows is only a factor of more than that used by the state-of-the-art algorithm prior to our paper due to Nelson, Nguyen and Woodruff (RANDOM'12). Their algorithm runs in time at least linear in the universe size , which is highly undesirable in streaming applications. Our approach is based on an iterative procedure, where most unrecovered heavy hitters are identified in each iteration. Although this technique has been extensively employed in the related problem of sparse recovery, this is the first time, to the best of our knowledge, that it has been used in the context of heavy hitters.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
