Formal Privacy for Partially Private Data
Jeremy Seeman, Matthew Reimherr, Aleksandra Slavkovic

TL;DR
This paper introduces a new privacy formalism called $(\epsilon, \\{\\Theta_z\\}_{z \\in \\mathcal{Z}})$-Pufferfish, which extends differential privacy to collections of data releases with mixed privacy guarantees, and provides mechanisms and algorithms for partially private data analysis.
Contribution
The paper proposes the $(\\epsilon, \\{\\Theta_z\\}_{z \\in \\mathcal{Z}})$-Pufferfish formalism, mechanisms for partial privacy, and algorithms for posterior sampling, advancing privacy analysis for mixed data releases.
Findings
Formalizes a new privacy framework extending DP.
Provides mechanisms satisfying the new privacy guarantees.
Demonstrates improved inference methods on COVID-19 data.
Abstract
Differential privacy (DP) quantifies privacy loss by analyzing noise injected into output statistics. For non-trivial statistics, this noise is necessary to ensure finite privacy loss. However, data curators frequently release collections of statistics where some use DP mechanisms and others are released as-is, i.e., without additional randomized noise. Consequently, DP alone cannot characterize the privacy loss attributable to the entire collection of releases. In this paper, we present a privacy formalism, -Pufferfish (-TP for short when is implied), a collection of Pufferfish mechanisms indexed by realizations of a random variable representing public information not protected with DP noise. First, we prove that this definition has similar properties to DP. Next, we introduce mechanisms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Advanced Causal Inference Techniques · Cryptography and Data Security
