Max-Information, Differential Privacy, and Post-Selection Hypothesis   Testing

Ryan Rogers; Aaron Roth; Adam Smith; Om Thakkar

arXiv:1604.03924·cs.LG·September 12, 2016

Max-Information, Differential Privacy, and Post-Selection Hypothesis Testing

Ryan Rogers, Aaron Roth, Adam Smith, Om Thakkar

PDF

TL;DR

This paper explores how approximate differential privacy guarantees can be used to perform valid adaptive hypothesis testing by controlling max-information, extending the understanding of privacy's role in generalization and statistical validity.

Contribution

It establishes a connection between $(\epsilon,\delta)$-differential privacy and bounded max-information for product distributions, and analyzes composition limitations in this context.

Findings

01

$(\epsilon,\delta)$-DP algorithms have bounded max-information on product distributions.

02

Differential privacy can be used to correct $p$-values in adaptive hypothesis testing.

03

Limitations of composition show the connection only holds for inputs from product distributions.

Abstract

In this paper, we initiate a principled study of how the generalization properties of approximate differential privacy can be used to perform adaptive hypothesis testing, while giving statistically valid $p$ -value corrections. We do this by observing that the guarantees of algorithms with bounded approximate max-information are sufficient to correct the $p$ -values of adaptively chosen hypotheses, and then by proving that algorithms that satisfy $(ϵ, δ)$ -differential privacy have bounded approximate max information when their inputs are drawn from a product distribution. This substantially extends the known connection between differential privacy and max-information, which previously was only known to hold for (pure) $(ϵ, 0)$ -differential privacy. It also extends our understanding of max-information as a partially unifying measure controlling the generalization…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.