Scalable DP-SGD: Shuffling vs. Poisson Subsampling
Lynn Chua, Badih Ghazi, Pritish Kamath, Ravi Kumar, Pasin Manurangsi,, Amer Sinha, Chiyuan Zhang

TL;DR
This paper investigates the privacy guarantees of shuffling-based DP-SGD compared to Poisson subsampling, revealing significant gaps and proposing scalable methods for Poisson subsampling to improve model utility.
Contribution
It provides new lower bounds on privacy for shuffled batch sampling, questions current practices, and introduces scalable Poisson subsampling techniques for better utility.
Findings
Shuffling-based DP-SGD has weaker privacy guarantees than previously thought.
Poisson subsampling can be implemented efficiently at scale.
Models trained with Poisson subsampling show improved utility.
Abstract
We provide new lower bounds on the privacy guarantee of the multi-epoch Adaptive Batch Linear Queries (ABLQ) mechanism with shuffled batch sampling, demonstrating substantial gaps when compared to Poisson subsampling; prior analysis was limited to a single epoch. Since the privacy analysis of Differentially Private Stochastic Gradient Descent (DP-SGD) is obtained by analyzing the ABLQ mechanism, this brings into serious question the common practice of implementing shuffling-based DP-SGD, but reporting privacy parameters as if Poisson subsampling was used. To understand the impact of this gap on the utility of trained machine learning models, we introduce a practical approach to implement Poisson subsampling at scale using massively parallel computation, and efficiently train models with the same. We compare the utility of models trained with Poisson-subsampling-based DP-SGD, and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMedical Image Segmentation Techniques · Computer Graphics and Visualization Techniques · 3D Shape Modeling and Analysis
