Understanding and Mitigating the Tradeoff Between Robustness and Accuracy
Aditi Raghunathan, Sang Michael Xie, Fanny Yang, John Duchi, Percy, Liang

TL;DR
This paper analyzes the tradeoff between robustness and accuracy in adversarial training, providing theoretical insights in linear regression and empirical evidence that robust self-training can improve both errors in neural networks.
Contribution
It offers a precise characterization of the robustness-accuracy tradeoff in linear regression and demonstrates that robust self-training enhances both errors in neural networks.
Findings
Standard error can increase even with noiseless optimal predictors.
Robust self-training improves both standard and robust errors in neural networks.
Empirical results show improvements on CIFAR-10 with various adversarial methods.
Abstract
Adversarial training augments the training set with perturbations to improve the robust error (over worst-case perturbations), but it often leads to an increase in the standard error (on unperturbed test inputs). Previous explanations for this tradeoff rely on the assumption that no predictor in the hypothesis class has low standard and robust error. In this work, we precisely characterize the effect of augmentation on the standard error in linear regression when the optimal linear predictor has zero standard and robust error. In particular, we show that the standard error could increase even when the augmented perturbations have noiseless observations from the optimal linear predictor. We then prove that the recently proposed robust self-training (RST) estimator improves robust error without sacrificing standard error for noiseless linear regression. Empirically, for neural networks,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning
MethodsTest · Linear Regression
