Revisiting Differentially Private Hypothesis Tests for Categorical Data
Yue Wang, Jaewoo Lee, Daniel Kifer

TL;DR
This paper develops new differentially private hypothesis tests for categorical data that improve power and reliability by adjusting for privacy-induced noise, applicable to various statistical tests.
Contribution
It introduces practical, bias-corrected differentially private tests for categorical data, using a new asymptotic regime and modified test equivalences.
Findings
Enhanced test power under differential privacy
Reliable p-values with bias correction
Effective on diverse datasets and privacy levels
Abstract
In this paper, we consider methods for performing hypothesis tests on data protected by a statistical disclosure control technology known as differential privacy. Previous approaches to differentially private hypothesis testing either perturbed the test statistic with random noise having large variance (and resulted in a significant loss of power) or added smaller amounts of noise directly to the data but failed to adjust the test in response to the added noise (resulting in biased, unreliable -values). In this paper, we develop a variety of practical hypothesis tests that address these problems. Using a different asymptotic regime that is more suited to hypothesis testing with privacy, we show a modified equivalence between chi-squared tests and likelihood ratio tests. We then develop differentially private likelihood ratio and chi-squared tests for a variety of applications on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Mobile Crowdsensing and Crowdsourcing
