Are Labels Required for Improving Adversarial Robustness?

Jonathan Uesato; Jean-Baptiste Alayrac; Po-Sen Huang; Robert; Stanforth; Alhussein Fawzi; Pushmeet Kohli

arXiv:1905.13725·cs.LG·December 6, 2019·92 cites

Are Labels Required for Improving Adversarial Robustness?

Jonathan Uesato, Jean-Baptiste Alayrac, Po-Sen Huang, Robert, Stanforth, Alhussein Fawzi, Pushmeet Kohli

PDF

Open Access 1 Repo

TL;DR

This paper demonstrates that unlabeled data can effectively replace labeled data in adversarial training, significantly improving robustness and reducing the need for costly annotations.

Contribution

It introduces a theoretical and empirical framework showing unlabeled data can match supervised data in adversarial robustness, with practical improvements on CIFAR-10.

Findings

01

Unlabeled data improves robust accuracy by 21.7% on CIFAR-10.

02

Unsupervised adversarial training captures over 95% of supervised improvements.

03

Achieved a 4% improvement over previous state-of-the-art using unlabeled data.

Abstract

Recent work has uncovered the interesting (and somewhat surprising) finding that training models to be invariant to adversarial perturbations requires substantially larger datasets than those required for standard classification. This result is a key hurdle in the deployment of robust machine learning models in many real world applications where labeled data is expensive. Our main insight is that unlabeled data can be a competitive alternative to labeled data for training adversarially robust models. Theoretically, we show that in a simple statistical setting, the sample complexity for learning an adversarially robust model from unlabeled data matches the fully supervised case up to constant factors. On standard datasets like CIFAR-10, a simple Unsupervised Adversarial Training (UAT) approach using unlabeled data improves robust accuracy by 21.7% over using 4K supervised examples alone,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

deepmind/deepmind-research/tree/master/unsupervised_adversarial_training
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Integrated Circuits and Semiconductor Failure Analysis