On the Structure of Replicable Hypothesis Testers
Anders Aamand, Maryam Aliakbarpour, Justin Y. Chen, Shyam Narayanan, Sandeep Silwal

TL;DR
This paper develops a comprehensive framework for understanding and designing replicable hypothesis testing algorithms, establishing bounds, canonical properties, and practical algorithms that enhance trust and reliability in statistical testing.
Contribution
It introduces canonical properties for replicable testers, proves bounds for key testing problems, and provides polynomial-time algorithms for Gaussian mean testing.
Findings
Canonical properties enable transformation of any replicable tester without loss of performance.
New lower bounds established for uniformity, identity, and closeness testing.
Polynomial-time algorithms achieved for Gaussian mean testing.
Abstract
A hypothesis testing algorithm is replicable if, when run on two different samples from the same distribution, it produces the same output with high probability. This notion, defined by by Impagliazzo, Lei, Pitassi, and Sorell [STOC'22], can increase trust in testing procedures and is deeply related to algorithmic stability, generalization, and privacy. We build general tools to prove lower and upper bounds on the sample complexity of replicable testers, unifying and quantitatively improving upon existing results. We identify a set of canonical properties, and prove that any replicable testing algorithm can be modified to satisfy these properties without worsening accuracy or sample complexity. A canonical replicable algorithm computes a deterministic function of its input (i.e., a test statistic) and thresholds against a uniformly random value in . It is invariant to the order…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
