Feature Dropout: Revisiting the Role of Augmentations in Contrastive Learning
Alex Tamkin, Margalit Glasgow, Xiluo He, Noah Goodman

TL;DR
This paper investigates the role of label-destroying augmentations in contrastive learning, demonstrating their usefulness in learning diverse representations for multiple downstream tasks, supported by empirical experiments and theoretical analysis.
Contribution
It reveals that label-destroying augmentations can improve contrastive learning for foundation models, challenging the traditional view that augmentations should preserve labels.
Findings
Label-destroying augmentations can outperform expert-designed ones.
Augmentations alter features like shapes and digits, aiding diverse tasks.
Theoretical analysis shows their importance in multi-task learning.
Abstract
What role do augmentations play in contrastive learning? Recent work suggests that good augmentations are label-preserving with respect to a specific downstream task. We complicate this picture by showing that label-destroying augmentations can be useful in the foundation model setting, where the goal is to learn diverse, general-purpose representations for multiple downstream tasks. We perform contrastive learning experiments on a range of image and audio datasets with multiple downstream tasks (e.g. for digits superimposed on photographs, predicting the class of one vs. the other). We find that Viewmaker Networks, a recently proposed model for learning augmentations for contrastive learning, produce label-destroying augmentations that stochastically destroy features needed for different downstream tasks. These augmentations are interpretable (e.g. altering shapes, digits, or letters…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing · Hearing Loss and Rehabilitation
