The power of private likelihood-ratio tests for goodness-of-fit in frequency tables
Emanuele Dolera, Stefano Favaro

TL;DR
This paper analyzes the impact of differential privacy on the power of likelihood-ratio tests for goodness-of-fit in frequency tables, quantifying the trade-off between privacy and statistical utility through large deviation theory.
Contribution
It provides a rigorous large deviation analysis of private likelihood-ratio tests under differential privacy, revealing the sample size cost for maintaining test power.
Findings
Quantifies the power loss due to privacy perturbations.
Derives a large deviation expansion for private LR test power.
Characterizes the sample size increase needed for privacy-preserving tests.
Abstract
Privacy-protecting data analysis investigates statistical methods under privacy constraints. This is a rising challenge in modern statistics, as the achievement of confidentiality guarantees, which typically occurs through suitable perturbations of the data, may determine a loss in the statistical utility of the data. In this paper, we consider privacy-protecting tests for goodness-of-fit in frequency tables, this being arguably the most common form of releasing data, and present a rigorous analysis of the large sample behaviour of a private likelihood-ratio (LR) test. Under the framework of -differential privacy for perturbed data, our main contribution is the power analysis of the private LR test, which characterizes the trade-off between confidentiality, measured via the differential privacy parameters , and statistical utility, measured…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsTest
