Unlabeled Data Improves Adversarial Robustness
Yair Carmon, Aditi Raghunathan, Ludwig Schmidt, Percy Liang, John, C. Duchi

TL;DR
This paper shows that unlabeled data can significantly improve adversarial robustness through semisupervised learning, both theoretically by bridging sample complexity gaps and empirically by enhancing robustness on CIFAR-10 and SVHN datasets.
Contribution
It provides a theoretical proof that unlabeled data bridges the robustness gap and demonstrates empirically that semisupervised learning improves adversarial robustness on multiple datasets.
Findings
Unlabeled data bridges the sample complexity gap for robust classification.
Self-training with unlabeled data achieves high robust accuracy with fewer labels.
Augmenting datasets with unlabeled data improves robustness against strong attacks.
Abstract
We demonstrate, theoretically and empirically, that adversarial robustness can significantly benefit from semisupervised learning. Theoretically, we revisit the simple Gaussian model of Schmidt et al. that shows a sample complexity gap between standard and robust classification. We prove that unlabeled data bridges this gap: a simple semisupervised learning procedure (self-training) achieves high robust accuracy using the same number of labels required for achieving high standard accuracy. Empirically, we augment CIFAR-10 with 500K unlabeled images sourced from 80 Million Tiny Images and use robust self-training to outperform state-of-the-art robust accuracies by over 5 points in (i) robustness against several strong attacks via adversarial training and (ii) certified and robustness via randomized smoothing. On SVHN, adding the dataset's own extra…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning
