Improved Baselines with Momentum Contrastive Learning
Xinlei Chen, Haoqi Fan, Ross Girshick, Kaiming He

TL;DR
This paper enhances the MoCo framework for contrastive unsupervised learning by incorporating design improvements from SimCLR, such as an MLP projection head and increased data augmentation, leading to stronger, more accessible baselines.
Contribution
It demonstrates that simple modifications to MoCo can outperform SimCLR without large batch sizes, improving baseline performance in contrastive learning.
Findings
Stronger MoCo baselines surpass SimCLR performance.
No need for large training batches with modifications.
Enhanced unsupervised learning accessibility.
Abstract
Contrastive unsupervised learning has recently shown encouraging progress, e.g., in Momentum Contrast (MoCo) and SimCLR. In this note, we verify the effectiveness of two of SimCLR's design improvements by implementing them in the MoCo framework. With simple modifications to MoCo---namely, using an MLP projection head and more data augmentation---we establish stronger baselines that outperform SimCLR and do not require large training batches. We hope this will make state-of-the-art unsupervised learning research more accessible. Code will be made public.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Speech Recognition and Synthesis · Domain Adaptation and Few-Shot Learning
MethodsDense Connections · InfoNCE · Random Gaussian Blur · Feedforward Network · SGD with Momentum · Random Horizontal Flip · Random Resized Crop · Cosine Annealing · MoCo v2 · Momentum Contrast
