Rethinking the Security of DP-SGD: A Corrected Analysis of Differentially Private Machine Learning
Wenhao Wang, Shujie Cui, Hui Cui, Xingliang Yuan

TL;DR
This paper revisits the privacy analysis of DP-SGD, revealing that common normalization steps in implementations can weaken privacy guarantees and lead to underestimated privacy leakage.
Contribution
It introduces a corrected analysis framework for DP-SGD that accounts for normalization steps, and empirically demonstrates potential privacy leaks in existing implementations.
Findings
Standard SGM analysis can overestimate privacy guarantees.
Normalization steps in implementations can weaken privacy guarantees.
Empirical audits show leakage beyond standard guarantees.
Abstract
Differentially Private Stochastic Gradient Descent (DP-SGD) is widely used to protect training data in machine learning. Its privacy guarantee is commonly analyzed through a security game in which an adversary infers whether a target record is included in the training dataset from the mechanism output. The resulting privacy leakage is characterized by a privacy curve, which reports the false negative rate as a function of the false positive rate. We identify a mismatch between this formal analysis and common DP-SGD implementations. Existing analyses often model DP-SGD and its variants as the Subsampled Gaussian Mechanism (SGM), where Gaussian noise is added to the sum of clipped gradients computed from a Poisson-sampled batch. In practice, however, many implementations apply an additional normalization step: the noisy gradient sum is divided either by the expected batch size or by the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
