Machine Learning with Privacy for Protected Attributes

Saeed Mahloujifar; Chuan Guo; G. Edward Suh; Kamalika Chaudhuri

arXiv:2506.19836·cs.CR·June 25, 2025

Machine Learning with Privacy for Protected Attributes

Saeed Mahloujifar, Chuan Guo, G. Edward Suh, Kamalika Chaudhuri

PDF

Open Access

TL;DR

This paper introduces feature differential privacy (FDP), a flexible privacy framework that protects specific attributes in machine learning, improving utility while maintaining privacy, demonstrated through diffusion models with significant performance gains.

Contribution

The paper proposes a novel, simulation-based FDP framework that generalizes differential privacy for protected attributes, with theoretical properties and practical algorithms for improved utility.

Findings

01

FDP can significantly improve model utility when public features are available.

02

The modified DP-SGD algorithm satisfies FDP and benefits from amplification via sub-sampling.

03

Application to diffusion models shows a drastic reduction in FID scores, e.g., from 286.7 to 101.9 at ε=8.

Abstract

Differential privacy (DP) has become the standard for private data analysis. Certain machine learning applications only require privacy protection for specific protected attributes. Using naive variants of differential privacy in such use cases can result in unnecessary degradation of utility. In this work, we refine the definition of DP to create a more general and flexible framework that we call feature differential privacy (FDP). Our definition is simulation-based and allows for both addition/removal and replacement variants of privacy, and can handle arbitrary and adaptive separation of protected and non-protected features. We prove the properties of FDP, such as adaptive composition, and demonstrate its implications for limiting attribute inference attacks. We also propose a modification of the standard DP-SGD algorithm that satisfies FDP while leveraging desirable properties such…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data

MethodsDiffusion