On The Memory Complexity of Uniformity Testing
Tomer Berg, Or Ordentlich, Ofer Shayevitz

TL;DR
This paper investigates the memory requirements for uniformity testing of distributions using limited memory machines, establishing bounds that show the minimal number of states needed based on distribution size and distance from uniform.
Contribution
It provides the first tight bounds on the number of states required for uniformity testing with limited memory, moving beyond collision counting and Paninski prior methods.
Findings
Upper bound on states: O(n log n / ε)
Lower bound on states: Ω(n + 1/ε)
Different techniques are needed from prior collision-based methods
Abstract
In this paper we consider the problem of uniformity testing with limited memory. We observe a sequence of independent identically distributed random variables drawn from a distribution over , which is either uniform or is -far from uniform under the total variation distance, and our goal is to determine the correct hypothesis. At each time point we are allowed to update the state of a finite-memory machine with states, where each state of the machine is assigned one of the hypotheses, and we are interested in obtaining an asymptotic probability of error at most uniformly under both hypotheses. The main contribution of this paper is deriving upper and lower bounds on the number of states needed in order to achieve a constant error probability , as a function of and , where our upper bound is $O(\frac{n\log…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Optimization and Search Problems · Cryptography and Data Security
