Toward Training at ImageNet Scale with Differential Privacy

Alexey Kurakin; Shuang Song; Steve Chien; Roxana Geambasu; Andreas; Terzis; Abhradeep Thakurta

arXiv:2201.12328·cs.LG·February 10, 2022·21 cites

Toward Training at ImageNet Scale with Differential Privacy

Alexey Kurakin, Shuang Song, Steve Chien, Roxana Geambasu, Andreas, Terzis, Abhradeep Thakurta

PDF

Open Access 1 Repo

TL;DR

This paper explores methods to train large neural networks with differential privacy on ImageNet, achieving a new baseline accuracy of 47.9% with privacy guarantees, and shares insights and code for future research.

Contribution

It introduces practical approaches and training settings that improve DP training speed and accuracy on large-scale image classification tasks.

Findings

01

Achieved 47.9% accuracy on ImageNet with DP

02

Identified training strategies that enhance DP training efficiency

03

Provided open-source code for reproducibility and further research

Abstract

Differential privacy (DP) is the de facto standard for training machine learning (ML) models, including neural networks, while ensuring the privacy of individual examples in the training set. Despite a rich literature on how to train ML models with differential privacy, it remains extremely challenging to train real-life, large neural networks with both reasonable accuracy and privacy. We set out to investigate how to do this, using ImageNet image classification as a poster example of an ML task that is very challenging to resolve accurately with DP right now. This paper shares initial lessons from our effort, in the hope that it will inspire and inform other researchers to explore DP training at scale. We show approaches that help make DP training faster, as well as model types and settings of the training process that tend to work better in the DP setting. Combined, the methods we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

google-research/dp-imagenet
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data