Characterizing and Improving the Robustness of Self-Supervised Learning through Background Augmentations
Chaitanya K. Ryali, David J. Schwab, Ari S. Morcos

TL;DR
This paper introduces background augmentations in self-supervised learning to focus models on relevant image content, leading to improved accuracy and robustness across multiple tasks and settings.
Contribution
It proposes simple background augmentation techniques that enhance self-supervised learning by reducing reliance on background cues, achieving state-of-the-art performance.
Findings
+1-2% on ImageNet accuracy
Up to 4.2% improvement in limited-label settings
Enhanced robustness to distribution shifts
Abstract
Recent progress in self-supervised learning has demonstrated promising results in multiple visual tasks. An important ingredient in high-performing self-supervised methods is the use of data augmentation by training models to place different augmented views of the same image nearby in embedding space. However, commonly used augmentation pipelines treat images holistically, ignoring the semantic relevance of parts of an image-e.g. a subject vs. a background-which can lead to the learning of spurious correlations. Our work addresses this problem by investigating a class of simple, yet highly effective "background augmentations", which encourage models to focus on semantically-relevant content by discouraging them from focusing on image backgrounds. Through a systematic investigation, we show that background augmentations lead to substantial improvements in performance across a spectrum of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Adversarial Robustness in Machine Learning
MethodsBootstrap Your Own Latent
