Differentially Private Optimizers Can Learn Adversarially Robust Models
Yuan Zhang, Zhiqi Bu

TL;DR
This paper demonstrates that differentially private training can produce models that are both accurate and adversarially robust, sometimes outperforming non-private models, challenging prior assumptions about privacy-robustness tradeoffs.
Contribution
First theoretical and empirical analysis showing DP models can be robust and accurate, highlighting key factors influencing the privacy-robustness-accuracy tradeoff.
Findings
DP models can be more robust than non-private models.
Proper hyper-parameters and pre-training improve robustness.
DP models are Pareto optimal on the accuracy-robustness tradeoff.
Abstract
Machine learning models have shone in a variety of domains and attracted increasing attention from both the security and the privacy communities. One important yet worrying question is: Will training models under the differential privacy (DP) constraint have an unfavorable impact on their adversarial robustness? While previous works have postulated that privacy comes at the cost of worse robustness, we give the first theoretical analysis to show that DP models can indeed be robust and accurate, even sometimes more robust than their naturally-trained non-private counterparts. We observe three key factors that influence the privacy-robustness-accuracy tradeoff: (1) hyper-parameters for DP optimizers are critical; (2) pre-training on public data significantly mitigates the accuracy and robustness drop; (3) choice of DP optimizers makes a difference. With these factors set properly, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Advanced Neural Network Applications
MethodsMulti-Head Attention · Attention Is All You Need · Position-Wise Feed-Forward Layer · Label Smoothing · Average Pooling · 1x1 Convolution · Kaiming Initialization · Linear Layer · Softmax · Batch Normalization
