TL;DR
This paper introduces Balanced Iteration Subsampling (BIS), a structured sampling scheme for DP-SGD that reduces participation variance and achieves stronger privacy amplification than Poisson subsampling, especially in low-noise regimes.
Contribution
The authors propose BIS, a novel structured subsampling scheme that outperforms Poisson sampling in privacy amplification and provide a practical Monte Carlo accountant for it.
Findings
BIS achieves stronger privacy amplification than Poisson subsampling.
BIS reduces the required noise multiplier by up to 9.6%.
Structured participation in sampling improves privacy and practicality.
Abstract
Poisson subsampling is the default sampling scheme in differentially private machine learning, largely because its unstructured randomness yields tractable privacy amplification analyses. Yet this same randomness introduces substantial participation variance: each sample appears in very different numbers of training iterations. In this work, we show that this variance is not merely a practical artifact to be tolerated, but a fundamental source of suboptimal privacy amplification. We prove that Balanced Iteration Subsampling (BIS), a structured scheme in which each sample participates in exactly a fixed number of iterations, achieves stronger privacy amplification than Poisson subsampling and is optimal at both extremes of the noise spectrum ( and ). Our analysis reveals that the privacy-noise tradeoff is governed not by maximizing randomness, but by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
