TL;DR
This paper reveals that sampling noise and additive Gaussian noise in differentially private SGD have equivalent effects on utility, but are not equally accounted for in privacy budgets, leading to a new training paradigm that improves privacy-utility tradeoffs.
Contribution
The authors propose a paradigm shift in noise allocation in DP-SGD, favoring additive noise to better utilize the privacy budget and enhance model utility.
Findings
Equivalent impact of sampling and additive noise on utility.
Improved privacy/utility tradeoff in private CNNs.
Enhanced state-of-the-art performance in private learning.
Abstract
Learning often involves sensitive data and as such, privacy preserving extensions to Stochastic Gradient Descent (SGD) and other machine learning algorithms have been developed using the definitions of Differential Privacy (DP). In differentially private SGD, the gradients computed at each training iteration are subject to two different types of noise. Firstly, inherent sampling noise arising from the use of minibatches. Secondly, additive Gaussian noise from the underlying mechanisms that introduce privacy. In this study, we show that these two types of noise are equivalent in their effect on the utility of private neural networks, however they are not accounted for equally in the privacy budget. Given this observation, we propose a training paradigm that shifts the proportions of noise towards less inherent and more additive noise, such that more of the overall noise can be accounted…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsStochastic Gradient Descent
