Differentially Private Testing of Identity and Closeness of Discrete Distributions
Jayadev Acharya, Ziteng Sun, Huanyu Zhang

TL;DR
This paper establishes optimal sample complexity bounds for differentially private identity and closeness testing of discrete distributions, introducing new algorithms and a general framework for lower bounds.
Contribution
It provides the first optimal algorithms for private closeness testing and a novel framework for lower bounds in private statistical testing.
Findings
Optimal sample complexity algorithms for identity testing.
First results and optimal bounds for private closeness testing.
A general framework for lower bounds using coupling and priors.
Abstract
We study the fundamental problems of identity testing (goodness of fit), and closeness testing (two sample test) of distributions over elements, under differential privacy. While the problems have a long history in statistics, finite sample bounds for these problems have only been established recently. In this work, we derive upper and lower bounds on the sample complexity of both the problems under -differential privacy. We provide optimal sample complexity algorithms for identity testing problem for all parameter ranges, and the first results for closeness testing. Our closeness testing bounds are optimal in the sparse regime where the number of samples is at most . Our upper bounds are obtained by privatizing non-private estimators for these problems. The non-private estimators are chosen to have small sensitivity. We propose a general framework to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Wireless Communication Security Techniques
