Fast-MoCo: Boost Momentum-based Contrastive Learning with Combinatorial Patches
Yuanzheng Ci, Chen Lin, Lei Bai, Wanli Ouyang

TL;DR
Fast-MoCo introduces a combinatorial patch strategy to generate multiple positive pairs from two augmented views, significantly reducing training epochs needed for effective contrastive self-supervised learning.
Contribution
It proposes a novel combinatorial patches method to enhance positive pair generation, accelerating training without extra computational cost.
Findings
Achieves 73.5% accuracy in 100 epochs, comparable to MoCo v3 trained for 800 epochs.
Further training to 300 epochs yields 75.1% accuracy, matching state-of-the-art performance.
Effective across multiple downstream tasks.
Abstract
Contrastive-based self-supervised learning methods achieved great success in recent years. However, self-supervision requires extremely long training epochs (e.g., 800 epochs for MoCo v3) to achieve promising results, which is unacceptable for the general academic community and hinders the development of this topic. This work revisits the momentum-based contrastive learning frameworks and identifies the inefficiency in which two augmented views generate only one positive pair. We propose Fast-MoCo - a novel framework that utilizes combinatorial patches to construct multiple positive pairs from two augmented views, which provides abundant supervision signals that bring significant acceleration with neglectable extra computational cost. Fast-MoCo trained with 100 epochs achieves 73.5% linear evaluation accuracy, similar to MoCo v3 (ResNet-50 backbone) trained with 800 epochs. Extra…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Enhancement Techniques · Video Surveillance and Tracking Methods · Domain Adaptation and Few-Shot Learning
MethodsContrastive Learning · Batch Normalization · MoCo v3 · InfoNCE · Momentum Contrast
