Toward Training at ImageNet Scale with Differential Privacy
Alexey Kurakin, Shuang Song, Steve Chien, Roxana Geambasu, Andreas, Terzis, Abhradeep Thakurta

TL;DR
This paper explores methods to train large neural networks with differential privacy on ImageNet, achieving a new baseline accuracy of 47.9% with privacy guarantees, and shares insights and code for future research.
Contribution
It introduces practical approaches and training settings that improve DP training speed and accuracy on large-scale image classification tasks.
Findings
Achieved 47.9% accuracy on ImageNet with DP
Identified training strategies that enhance DP training efficiency
Provided open-source code for reproducibility and further research
Abstract
Differential privacy (DP) is the de facto standard for training machine learning (ML) models, including neural networks, while ensuring the privacy of individual examples in the training set. Despite a rich literature on how to train ML models with differential privacy, it remains extremely challenging to train real-life, large neural networks with both reasonable accuracy and privacy. We set out to investigate how to do this, using ImageNet image classification as a poster example of an ML task that is very challenging to resolve accurately with DP right now. This paper shares initial lessons from our effort, in the hope that it will inspire and inform other researchers to explore DP training at scale. We show approaches that help make DP training faster, as well as model types and settings of the training process that tend to work better in the DP setting. Combined, the methods we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data
