Rethinking Evaluation Protocols of Visual Representations Learned via Self-supervised Learning
Jae-Hun Lee, Doyoung Yoon, ByeongMoon Ji, Kyungyul Kim, Sangheum Hwang

TL;DR
This paper critically examines the evaluation protocols for self-supervised visual representations, revealing that hyperparameter sensitivity affects performance assessments and proposing normalization techniques to improve robustness.
Contribution
It identifies key factors influencing SSL evaluation sensitivity and suggests improved normalization methods to make evaluations more reliable and representative of true representation quality.
Findings
Input normalization is crucial for stable LP performance.
Batch normalization before classifiers improves evaluation robustness.
SSL transferability is significantly affected by weight decay settings.
Abstract
Linear probing (LP) (and -NN) on the upstream dataset with labels (e.g., ImageNet) and transfer learning (TL) to various downstream datasets are commonly employed to evaluate the quality of visual representations learned via self-supervised learning (SSL). Although existing SSL methods have shown good performances under those evaluation protocols, we observe that the performances are very sensitive to the hyperparameters involved in LP and TL. We argue that this is an undesirable behavior since truly generic representations should be easily adapted to any other visual recognition task, i.e., the learned representations should be robust to the settings of LP and TL hyperparameters. In this work, we try to figure out the cause of performance sensitivity by conducting extensive experiments with state-of-the-art SSL methods. First, we find that input normalization for LP is crucial to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Image Processing Techniques and Applications · Multimodal Machine Learning Applications
MethodsWeight Decay · Batch Normalization
