Making Self-supervised Learning Robust to Spurious Correlation via   Learning-speed Aware Sampling

Weicheng Zhu; Sheng Liu; Carlos Fernandez-Granda; Narges Razavian

arXiv:2311.16361·cs.LG·December 1, 2023·1 cites

Making Self-supervised Learning Robust to Spurious Correlation via Learning-speed Aware Sampling

Weicheng Zhu, Sheng Liu, Carlos Fernandez-Granda, Narges Razavian

PDF

Open Access

TL;DR

This paper introduces a learning-speed aware sampling method for self-supervised learning that enhances robustness against spurious correlations by focusing on samples that learn more slowly, leading to better downstream task performance.

Contribution

It proposes a novel sampling strategy based on learning dynamics to mitigate the impact of spurious correlations in SSL, improving representation robustness.

Findings

01

LA-SSL improves downstream classification accuracy.

02

The method reduces reliance on spurious correlations.

03

Enhanced robustness demonstrated across multiple datasets.

Abstract

Self-supervised learning (SSL) has emerged as a powerful technique for learning rich representations from unlabeled data. The data representations are able to capture many underlying attributes of data, and be useful in downstream prediction tasks. In real-world settings, spurious correlations between some attributes (e.g. race, gender and age) and labels for downstream tasks often exist, e.g. cancer is usually more prevalent among elderly patients. In this paper, we investigate SSL in the presence of spurious correlations and show that the SSL training loss can be minimized by capturing only a subset of the conspicuous features relevant to those sensitive attributes, despite the presence of other important predictive features for the downstream tasks. To address this issue, we investigate the learning dynamics of SSL and observe that the learning is slower for samples that conflict…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning in Healthcare

MethodsAttentive Walk-Aggregating Graph Neural Network