Decoupled Contrastive Learning
Chun-Hsiao Yeh, Cheng-Yao Hong, Yen-Chi Hsu, Tyng-Luh Liu, Yubei Chen, and Yann LeCun

TL;DR
Decoupled Contrastive Learning (DCL) improves efficiency and robustness of self-supervised learning by removing the negative-positive coupling in the InfoNCE loss, achieving state-of-the-art results with less computational resources.
Contribution
The paper introduces DCL, a novel contrastive loss that removes the positive term from the denominator, enhancing learning efficiency and robustness over traditional methods.
Findings
DCL outperforms baseline methods on ImageNet-1K with fewer epochs.
DCL achieves 68.2% top-1 accuracy with batch size 256 in 200 epochs.
Combining DCL with NNCLR sets a new state-of-the-art at 72.3% accuracy.
Abstract
Contrastive learning (CL) is one of the most successful paradigms for self-supervised learning (SSL). In a principled way, it considers two augmented "views" of the same image as positive to be pulled closer, and all other images as negative to be pushed further apart. However, behind the impressive success of CL-based techniques, their formulation often relies on heavy-computation settings, including large sample batches, extensive training epochs, etc. We are thus motivated to tackle these issues and establish a simple, efficient, yet competitive baseline of contrastive learning. Specifically, we identify, from theoretical and empirical studies, a noticeable negative-positive-coupling (NPC) effect in the widely used InfoNCE loss, leading to unsuitable learning efficiency concerning the batch size. By removing the NPC effect, we propose decoupled contrastive learning (DCL) loss, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Geophysical Methods and Applications · Seismic Imaging and Inversion Techniques
MethodsBitcoin Customer Service Number +1-833-534-1729 · Nearest-Neighbor Contrastive Learning of Visual Representations · Contrastive Learning · 1x1 Convolution · Convolution · Residual Connection · Average Pooling · Bottleneck Residual Block · Dense Connections · Kaiming Initialization
