Equivariant Differentially Private Deep Learning: Why DP-SGD Needs Sparser Models
Florian A. H\"olzl, Daniel Rueckert, Georgios Kaissis

TL;DR
This paper demonstrates that using equivariant convolutional networks with sparse design significantly improves the efficiency and accuracy of differentially private deep learning, reducing computational costs and enhancing privacy-utility trade-offs.
Contribution
Introducing equivariant convolutional networks for DP-SGD to create sparse, efficient models that outperform state-of-the-art architectures in privacy-preserving image classification.
Findings
Up to 9% accuracy improvement on CIFAR-10
Over 85% reduction in computation time
Sparse equivariant models outperform dense counterparts
Abstract
Differentially Private Stochastic Gradient Descent (DP-SGD) limits the amount of private information deep learning models can memorize during training. This is achieved by clipping and adding noise to the model's gradients, and thus networks with more parameters require proportionally stronger perturbation. As a result, large models have difficulties learning useful information, rendering training with DP-SGD exceedingly difficult on more challenging training tasks. Recent research has focused on combating this challenge through training adaptations such as heavy data augmentation and large batch sizes. However, these techniques further increase the computational overhead of DP-SGD and reduce its practical applicability. In this work, we propose using the principle of sparse model design to solve precisely such complex tasks with fewer parameters, higher accuracy, and in less time, thus…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Advanced Neural Network Applications
