Revisiting the Critical Factors of Augmentation-Invariant Representation   Learning

Junqiang Huang; Xiangwen Kong; Xiangyu Zhang

arXiv:2208.00275·cs.CV·August 2, 2022

Revisiting the Critical Factors of Augmentation-Invariant Representation Learning

Junqiang Huang, Xiangwen Kong, Xiangyu Zhang

PDF

Open Access 1 Repo

TL;DR

This paper investigates the factors influencing augmentation-invariant representation learning, comparing MoCo v2 and BYOL, and establishes a fair benchmark revealing how model configurations and optimization strategies affect transfer performance.

Contribution

It introduces the first fair benchmark for MoCo v2 and BYOL, analyzing how network asymmetry and configurations impact their effectiveness across tasks.

Findings

01

Sophisticated model configurations improve adaptation to datasets.

02

Mismatched optimization strategies hinder transfer performance.

03

Asymmetry benefits linear evaluation but may impair long-tailed classification.

Abstract

We focus on better understanding the critical factors of augmentation-invariant representation learning. We revisit MoCo v2 and BYOL and try to prove the authenticity of the following assumption: different frameworks bring about representations of different characteristics even with the same pretext task. We establish the first benchmark for fair comparisons between MoCo v2 and BYOL, and observe: (i) sophisticated model configurations enable better adaptation to pre-training dataset; (ii) mismatched optimization strategies of pre-training and fine-tuning hinder model from achieving competitive transfer performances. Given the fair benchmark, we make further investigation and find asymmetry of network structure endows contrastive frameworks to work well under the linear evaluation protocol, while may hurt the transfer performances on long-tailed classification tasks. Moreover, negative…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

megvii-research/revisitairl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Advanced Neural Network Applications

MethodsDense Connections · Batch Normalization · Feedforward Network · Momentum Contrast · Bootstrap Your Own Latent · InfoNCE · Random Gaussian Blur · MoCo v2