Amplified Patch-Level Differential Privacy for Free via Random Cropping
Kaan Durmaz, Jan Schuchardt, Sebastian Schmidt, Stephan G\"unnemann

TL;DR
This paper reveals that random cropping in computer vision acts as an additional source of stochasticity in differentially private training, which amplifies privacy guarantees without altering the training process.
Contribution
The authors formalize the privacy amplification effect of random cropping at the patch level and derive tight bounds for DP-SGD, demonstrating improved privacy-utility trade-offs empirically.
Findings
Random cropping probabilistically excludes sensitive content, enhancing privacy.
Patch-level analysis quantifies the amplification effect.
Empirical results show improved privacy-utility trade-offs across datasets.
Abstract
Random cropping is one of the most common data augmentation techniques in computer vision, yet the role of its inherent randomness in training differentially private machine learning models has thus far gone unexplored. We observe that when sensitive content in an image is spatially localized, such as a face or license plate, random cropping can probabilistically exclude that content from the model's input. This introduces a third source of stochasticity in differentially private training with stochastic gradient descent, in addition to gradient noise and minibatch sampling. This additional randomness amplifies differential privacy without requiring changes to model architecture or training procedure. We formalize this effect by introducing a patch-level neighboring relation for vision data and deriving tight privacy bounds for differentially private stochastic gradient descent (DP-SGD)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Cryptography and Data Security
