Deep Augmentation: Dropout as Augmentation for Self-Supervised Learning
Rickard Br\"uel-Gabrielsson, Tongzhou Wang, Manel Baradad, Justin Solomon

TL;DR
This paper investigates the use of dropout as a data augmentation technique in self-supervised learning, proposing Deep Augmentation which applies dropout or PCA at specific network layers to improve performance across various modalities.
Contribution
It introduces Deep Augmentation, a modality- and network-agnostic method applying dropout or PCA at targeted layers, with insights on when and how dropout enhances self-supervised learning.
Findings
Dropout is most effective in deeper layers for contrastive learning.
A stop-gradient operation is crucial for dropout's effectiveness as augmentation.
Deep Augmentation can outperform traditional input-level augmentations.
Abstract
Despite dropout's ubiquity in machine learning, its effectiveness as a form of data augmentation remains under-explored. We address two key questions: (i) When is dropout effective as an augmentation strategy? (ii) Is dropout uniquely effective under these conditions? To explore these questions, we propose Deep Augmentation, a network- and modality-agnostic method that applies dropout or PCA transformations to targeted layers in neural networks. Through extensive experiments on contrastive learning tasks in NLP, computer vision, and graph learning, we find that uniformly applying dropout across layers does not consistently improve performance. Instead, dropout proves most beneficial in deeper layers and can be matched by alternative augmentations (e.g., PCA). We also show that a stop-gradient operation is critical for ensuring dropout functions effectively as an augmentation, and that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOnline Learning and Analytics
MethodsPrincipal Components Analysis · Dropout · Contrastive Learning
