Private High-Dimensional Hypothesis Testing
Shyam Narayanan

TL;DR
This paper introduces improved differentially private algorithms for high-dimensional Gaussian identity testing, achieving near-optimal sample complexity and extending to related distribution testing problems.
Contribution
It presents new private testing algorithms with optimal sample complexity for high-dimensional Gaussian distributions and related problems, surpassing previous work.
Findings
Efficient private algorithms match non-private sample complexity in many cases.
Sample complexity bounds are improved over prior work.
Private Gaussian identity testing can require fewer samples than discrete distribution testing.
Abstract
We provide improved differentially private algorithms for identity testing of high-dimensional distributions. Specifically, for -dimensional Gaussian distributions with known covariance , we can test whether the distribution comes from for some fixed or from some with total variation distance at least from with -differential privacy, using only \[\tilde{O}\left(\frac{d^{1/2}}{\alpha^2} + \frac{d^{1/3}}{\alpha^{4/3} \cdot \varepsilon^{2/3}} + \frac{1}{\alpha \cdot \varepsilon}\right)\] samples if the algorithm is allowed to be computationally inefficient, and only \[\tilde{O}\left(\frac{d^{1/2}}{\alpha^2} + \frac{d^{1/4}}{\alpha \cdot \varepsilon}\right)\] samples for a computationally efficient algorithm. We also provide a matching lower bound showing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Adversarial Robustness in Machine Learning
