Equivariant Differentially Private Deep Learning: Why DP-SGD Needs   Sparser Models

Florian A. H\"olzl; Daniel Rueckert; Georgios Kaissis

arXiv:2301.13104·cs.CV·June 22, 2023·1 cites

Equivariant Differentially Private Deep Learning: Why DP-SGD Needs Sparser Models

Florian A. H\"olzl, Daniel Rueckert, Georgios Kaissis

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that using equivariant convolutional networks with sparse design significantly improves the efficiency and accuracy of differentially private deep learning, reducing computational costs and enhancing privacy-utility trade-offs.

Contribution

Introducing equivariant convolutional networks for DP-SGD to create sparse, efficient models that outperform state-of-the-art architectures in privacy-preserving image classification.

Findings

01

Up to 9% accuracy improvement on CIFAR-10

02

Over 85% reduction in computation time

03

Sparse equivariant models outperform dense counterparts

Abstract

Differentially Private Stochastic Gradient Descent (DP-SGD) limits the amount of private information deep learning models can memorize during training. This is achieved by clipping and adding noise to the model's gradients, and thus networks with more parameters require proportionally stronger perturbation. As a result, large models have difficulties learning useful information, rendering training with DP-SGD exceedingly difficult on more challenging training tasks. Recent research has focused on combating this challenge through training adaptations such as heavy data augmentation and large batch sizes. However, these techniques further increase the computational overhead of DP-SGD and reduce its practical applicability. In this work, we propose using the principle of sparse model design to solve precisely such complex tasks with fewer parameters, higher accuracy, and in less time, thus…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hlzl/equivariant
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Advanced Neural Network Applications