On the Importance of Hyperparameters and Data Augmentation for Self-Supervised Learning
Diane Wagner, Fabio Ferreira, Danny Stoll, Robin Tibor Schirrmeister,, Samuel M\"uller, Frank Hutter

TL;DR
This paper demonstrates that hyperparameters and data augmentation strategies critically influence self-supervised learning performance, introduces GroupAugment for automated augmentation, and emphasizes their underestimated importance in SSL.
Contribution
The paper hyperparameterizes and optimizes hyperparameters and data augmentations for SSL, and introduces GroupAugment, an automated augmentation algorithm tailored for SSL.
Findings
Hyperparameters and data augmentation significantly affect SSL performance.
Bayesian optimization improves SSL results across datasets.
GroupAugment achieves high accuracy consistently across datasets.
Abstract
Self-Supervised Learning (SSL) has become a very active area of Deep Learning research where it is heavily used as a pre-training method for classification and other tasks. However, the rapid pace of advancements in this area comes at a price: training pipelines vary significantly across papers, which presents a potentially crucial confounding factor. Here, we show that, indeed, the choice of hyperparameters and data augmentation strategies can have a dramatic impact on performance. To shed light on these neglected factors and help maximize the power of SSL, we hyperparameterize these components and optimize them with Bayesian optimization, showing improvements across multiple datasets for the SimSiam SSL approach. Realizing the importance of data augmentations for SSL, we also introduce a new automated data augmentation algorithm, GroupAugment, which considers groups of augmentations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Domain Adaptation and Few-Shot Learning · Anomaly Detection Techniques and Applications
