Max-Information, Differential Privacy, and Post-Selection Hypothesis Testing
Ryan Rogers, Aaron Roth, Adam Smith, Om Thakkar

TL;DR
This paper explores how approximate differential privacy guarantees can be used to perform valid adaptive hypothesis testing by controlling max-information, extending the understanding of privacy's role in generalization and statistical validity.
Contribution
It establishes a connection between $(\epsilon,\delta)$-differential privacy and bounded max-information for product distributions, and analyzes composition limitations in this context.
Findings
$(\epsilon,\delta)$-DP algorithms have bounded max-information on product distributions.
Differential privacy can be used to correct $p$-values in adaptive hypothesis testing.
Limitations of composition show the connection only holds for inputs from product distributions.
Abstract
In this paper, we initiate a principled study of how the generalization properties of approximate differential privacy can be used to perform adaptive hypothesis testing, while giving statistically valid -value corrections. We do this by observing that the guarantees of algorithms with bounded approximate max-information are sufficient to correct the -values of adaptively chosen hypotheses, and then by proving that algorithms that satisfy -differential privacy have bounded approximate max information when their inputs are drawn from a product distribution. This substantially extends the known connection between differential privacy and max-information, which previously was only known to hold for (pure) -differential privacy. It also extends our understanding of max-information as a partially unifying measure controlling the generalization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
