Using Self-Supervised Learning Can Improve Model Robustness and   Uncertainty

Dan Hendrycks; Mantas Mazeika; Saurav Kadavath; Dawn Song

arXiv:1906.12340·cs.LG·October 30, 2019·343 cites

Using Self-Supervised Learning Can Improve Model Robustness and Uncertainty

Dan Hendrycks, Mantas Mazeika, Saurav Kadavath, Dawn Song

PDF

Open Access 4 Repos

TL;DR

This paper demonstrates that self-supervised learning significantly enhances model robustness against adversarial attacks, label noise, and input corruptions, and improves out-of-distribution detection, surpassing fully supervised methods.

Contribution

It reveals that self-supervised learning can improve robustness and uncertainty estimation, establishing these as key evaluation axes for future research.

Findings

01

Self-supervision improves robustness to adversarial examples.

02

Self-supervision enhances detection of out-of-distribution samples.

03

Self-supervised models outperform fully supervised ones in robustness metrics.

Abstract

Self-supervision provides effective representations for downstream tasks without requiring labels. However, existing approaches lag behind fully supervised training and are often not thought beneficial beyond obviating or reducing the need for annotations. We find that self-supervision can benefit robustness in a variety of ways, including robustness to adversarial examples, label corruption, and common input corruptions. Additionally, self-supervision greatly benefits out-of-distribution detection on difficult, near-distribution outliers, so much so that it exceeds the performance of fully supervised methods. These results demonstrate the promise of self-supervision for improving robustness and uncertainty estimation and establish these tasks as new axes of evaluation for future self-supervised learning research.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Machine Learning and Data Classification · Anomaly Detection Techniques and Applications

MethodsAverage Pooling · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block · Global Average Pooling · Residual Block · Kaiming Initialization · Max Pooling