How Robust is Unsupervised Representation Learning to Distribution   Shift?

Yuge Shi; Imant Daunhawer; Julia E. Vogt; Philip H.S. Torr; Amartya; Sanyal

arXiv:2206.08871·cs.LG·December 19, 2022·6 cites

How Robust is Unsupervised Representation Learning to Distribution Shift?

Yuge Shi, Imant Daunhawer, Julia E. Vogt, Philip H.S. Torr, Amartya, Sanyal

PDF

Open Access

TL;DR

This paper investigates how unsupervised representation learning methods like SSL and AE are more robust to distribution shifts than supervised learning, through extensive experiments on synthetic and real datasets.

Contribution

It provides the first comprehensive evaluation of unsupervised methods' robustness to distribution shift, introducing controllable datasets and a linear head evaluation approach.

Findings

01

Unsupervised representations outperform supervised ones under distribution shift.

02

Representations learned from SSL and AE generalize better than supervised models.

03

The study introduces controllable datasets for realistic distribution shift evaluation.

Abstract

The robustness of machine learning algorithms to distributions shift is primarily discussed in the context of supervised learning (SL). As such, there is a lack of insight on the robustness of the representations learned from unsupervised methods, such as self-supervised learning (SSL) and auto-encoder based algorithms (AE), to distribution shift. We posit that the input-driven objectives of unsupervised algorithms lead to representations that are more robust to distribution shift than the target-driven objective of SL. We verify this by extensively evaluating the performance of SSL and AE on both synthetic and realistic distribution shift datasets. Following observations that the linear layer used for classification itself can be susceptible to spurious correlations, we evaluate the representations using a linear head trained on a small amount of out-of-distribution (OOD) data, to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Machine Learning in Healthcare

MethodsLinear Layer · Autoencoders