Unlabeled Data Improves Adversarial Robustness

Yair Carmon; Aditi Raghunathan; Ludwig Schmidt; Percy Liang; John; C. Duchi

arXiv:1905.13736·stat.ML·January 14, 2022·73 cites

Unlabeled Data Improves Adversarial Robustness

Yair Carmon, Aditi Raghunathan, Ludwig Schmidt, Percy Liang, John, C. Duchi

PDF

Open Access 4 Repos

TL;DR

This paper shows that unlabeled data can significantly improve adversarial robustness through semisupervised learning, both theoretically by bridging sample complexity gaps and empirically by enhancing robustness on CIFAR-10 and SVHN datasets.

Contribution

It provides a theoretical proof that unlabeled data bridges the robustness gap and demonstrates empirically that semisupervised learning improves adversarial robustness on multiple datasets.

Findings

01

Unlabeled data bridges the sample complexity gap for robust classification.

02

Self-training with unlabeled data achieves high robust accuracy with fewer labels.

03

Augmenting datasets with unlabeled data improves robustness against strong attacks.

Abstract

We demonstrate, theoretically and empirically, that adversarial robustness can significantly benefit from semisupervised learning. Theoretically, we revisit the simple Gaussian model of Schmidt et al. that shows a sample complexity gap between standard and robust classification. We prove that unlabeled data bridges this gap: a simple semisupervised learning procedure (self-training) achieves high robust accuracy using the same number of labels required for achieving high standard accuracy. Empirically, we augment CIFAR-10 with 500K unlabeled images sourced from 80 Million Tiny Images and use robust self-training to outperform state-of-the-art robust accuracies by over 5 points in (i) $ℓ_{\infty}$ robustness against several strong attacks via adversarial training and (ii) certified $ℓ_{2}$ and $ℓ_{\infty}$ robustness via randomized smoothing. On SVHN, adding the dataset's own extra…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Domain Adaptation and Few-Shot Learning