Machine Learning with Privacy for Protected Attributes
Saeed Mahloujifar, Chuan Guo, G. Edward Suh, Kamalika Chaudhuri

TL;DR
This paper introduces feature differential privacy (FDP), a flexible privacy framework that protects specific attributes in machine learning, improving utility while maintaining privacy, demonstrated through diffusion models with significant performance gains.
Contribution
The paper proposes a novel, simulation-based FDP framework that generalizes differential privacy for protected attributes, with theoretical properties and practical algorithms for improved utility.
Findings
FDP can significantly improve model utility when public features are available.
The modified DP-SGD algorithm satisfies FDP and benefits from amplification via sub-sampling.
Application to diffusion models shows a drastic reduction in FID scores, e.g., from 286.7 to 101.9 at ε=8.
Abstract
Differential privacy (DP) has become the standard for private data analysis. Certain machine learning applications only require privacy protection for specific protected attributes. Using naive variants of differential privacy in such use cases can result in unnecessary degradation of utility. In this work, we refine the definition of DP to create a more general and flexible framework that we call feature differential privacy (FDP). Our definition is simulation-based and allows for both addition/removal and replacement variants of privacy, and can handle arbitrary and adaptive separation of protected and non-protected features. We prove the properties of FDP, such as adaptive composition, and demonstrate its implications for limiting attribute inference attacks. We also propose a modification of the standard DP-SGD algorithm that satisfies FDP while leveraging desirable properties such…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data
MethodsDiffusion
