TL;DR
This paper reveals that the common add/remove adjacency model in differential privacy overstates attribute privacy, demonstrating the importance of choosing the correct adjacency relation for accurate privacy guarantees.
Contribution
The authors identify limitations of add/remove adjacency in DP, develop novel attacks under substitute adjacency, and empirically show discrepancies in privacy guarantees.
Findings
Add/remove adjacency overstates attribute privacy compared to substitute adjacency.
Novel attacks demonstrate the gap between reported DP guarantees and actual privacy.
Empirical results show inconsistencies with add/remove-based privacy accounting.
Abstract
Training machine learning models with differential privacy (DP) limits an adversary's ability to infer sensitive information about the training data. It can be interpreted as a bound on adversary's capability to distinguish two adjacent datasets according to chosen adjacency relation. In practice, most DP implementations use the add/remove adjacency relation, where two datasets are adjacent if one can be obtained from the other by adding or removing a single record, thereby protecting membership. In many ML applications, however, the goal is to protect attributes of individual records (e.g., labels used in supervised fine-tuning). We show that privacy accounting under add/remove overstates attribute privacy compared to accounting under the substitute adjacency relation, which permits substituting one record. To demonstrate this gap, we develop novel attacks to audit DP under substitute…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
