Revisiting the Critical Factors of Augmentation-Invariant Representation Learning
Junqiang Huang, Xiangwen Kong, Xiangyu Zhang

TL;DR
This paper investigates the factors influencing augmentation-invariant representation learning, comparing MoCo v2 and BYOL, and establishes a fair benchmark revealing how model configurations and optimization strategies affect transfer performance.
Contribution
It introduces the first fair benchmark for MoCo v2 and BYOL, analyzing how network asymmetry and configurations impact their effectiveness across tasks.
Findings
Sophisticated model configurations improve adaptation to datasets.
Mismatched optimization strategies hinder transfer performance.
Asymmetry benefits linear evaluation but may impair long-tailed classification.
Abstract
We focus on better understanding the critical factors of augmentation-invariant representation learning. We revisit MoCo v2 and BYOL and try to prove the authenticity of the following assumption: different frameworks bring about representations of different characteristics even with the same pretext task. We establish the first benchmark for fair comparisons between MoCo v2 and BYOL, and observe: (i) sophisticated model configurations enable better adaptation to pre-training dataset; (ii) mismatched optimization strategies of pre-training and fine-tuning hinder model from achieving competitive transfer performances. Given the fair benchmark, we make further investigation and find asymmetry of network structure endows contrastive frameworks to work well under the linear evaluation protocol, while may hurt the transfer performances on long-tailed classification tasks. Moreover, negative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Advanced Neural Network Applications
MethodsDense Connections · Batch Normalization · Feedforward Network · Momentum Contrast · Bootstrap Your Own Latent · InfoNCE · Random Gaussian Blur · MoCo v2
