Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Performance
Zhuoning Yuan, Yuexin Wu, Zi-Hao Qiu, Xianzhi Du, Lijun Zhang, Denny, Zhou, Tianbao Yang

TL;DR
This paper introduces SogCLR, a memory-efficient stochastic optimization method for contrastive learning that achieves comparable performance to large-batch methods like SimCLR, enabling effective self-supervised learning with smaller batches.
Contribution
We propose SogCLR, a novel stochastic optimization algorithm that removes the large batch size requirement in contrastive learning, with theoretical guarantees and empirical validation.
Findings
SogCLR achieves similar performance to SimCLR with much smaller batch sizes.
The optimization error of SogCLR diminishes over iterations under reasonable conditions.
The method is applicable to various contrastive loss functions and is implemented in an open-source library.
Abstract
In this paper, we study contrastive learning from an optimization perspective, aiming to analyze and address a fundamental issue of existing contrastive learning methods that either rely on a large batch size or a large dictionary of feature vectors. We consider a global objective for contrastive learning, which contrasts each positive pair with all negative pairs for an anchor point. From the optimization perspective, we explain why existing methods such as SimCLR require a large batch size in order to achieve a satisfactory result. In order to remove such requirement, we propose a memory-efficient Stochastic Optimization algorithm for solving the Global objective of Contrastive Learning of Representations, named SogCLR. We show that its optimization error is negligible under a reasonable condition after a sufficient number of iterations or is diminishing for a slightly different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · MicroRNA in disease regulation
MethodsBitcoin Customer Service Number +1-833-534-1729 · *Communicated@Fast*How Do I Communicate to Expedia? · Contrastive Learning · Average Pooling · Residual Block · Batch Normalization · Global Average Pooling · Random Gaussian Blur · Max Pooling · 1x1 Convolution
