Not all noise is accounted equally: How differentially private learning   benefits from large sampling rates

Friedrich D\"ormann; Osvald Frisk; Lars N{\o}rvang Andersen; Christian; Fischer Pedersen

arXiv:2110.06255·cs.LG·October 14, 2021

Not all noise is accounted equally: How differentially private learning benefits from large sampling rates

Friedrich D\"ormann, Osvald Frisk, Lars N{\o}rvang Andersen, Christian, Fischer Pedersen

PDF

1 Repo

TL;DR

This paper reveals that sampling noise and additive Gaussian noise in differentially private SGD have equivalent effects on utility, but are not equally accounted for in privacy budgets, leading to a new training paradigm that improves privacy-utility tradeoffs.

Contribution

The authors propose a paradigm shift in noise allocation in DP-SGD, favoring additive noise to better utilize the privacy budget and enhance model utility.

Findings

01

Equivalent impact of sampling and additive noise on utility.

02

Improved privacy/utility tradeoff in private CNNs.

03

Enhanced state-of-the-art performance in private learning.

Abstract

Learning often involves sensitive data and as such, privacy preserving extensions to Stochastic Gradient Descent (SGD) and other machine learning algorithms have been developed using the definitions of Differential Privacy (DP). In differentially private SGD, the gradients computed at each training iteration are subject to two different types of noise. Firstly, inherent sampling noise arising from the use of minibatches. Secondly, additive Gaussian noise from the underlying mechanisms that introduce privacy. In this study, we show that these two types of noise are equivalent in their effect on the utility of private neural networks, however they are not accounted for equally in the privacy budget. Given this observation, we propose a training paradigm that shifts the proportions of noise towards less inherent and more additive noise, such that more of the overall noise can be accounted…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

osvaldfrisk/dp-not-all-noise-is-equal
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsStochastic Gradient Descent