Provable Stochastic Optimization for Global Contrastive Learning: Small   Batch Does Not Harm Performance

Zhuoning Yuan; Yuexin Wu; Zi-Hao Qiu; Xianzhi Du; Lijun Zhang; Denny; Zhou; Tianbao Yang

arXiv:2202.12387·cs.LG·September 22, 2022·1 cites

Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Performance

Zhuoning Yuan, Yuexin Wu, Zi-Hao Qiu, Xianzhi Du, Lijun Zhang, Denny, Zhou, Tianbao Yang

PDF

Open Access 1 Repo

TL;DR

This paper introduces SogCLR, a memory-efficient stochastic optimization method for contrastive learning that achieves comparable performance to large-batch methods like SimCLR, enabling effective self-supervised learning with smaller batches.

Contribution

We propose SogCLR, a novel stochastic optimization algorithm that removes the large batch size requirement in contrastive learning, with theoretical guarantees and empirical validation.

Findings

01

SogCLR achieves similar performance to SimCLR with much smaller batch sizes.

02

The optimization error of SogCLR diminishes over iterations under reasonable conditions.

03

The method is applicable to various contrastive loss functions and is implemented in an open-source library.

Abstract

In this paper, we study contrastive learning from an optimization perspective, aiming to analyze and address a fundamental issue of existing contrastive learning methods that either rely on a large batch size or a large dictionary of feature vectors. We consider a global objective for contrastive learning, which contrasts each positive pair with all negative pairs for an anchor point. From the optimization perspective, we explain why existing methods such as SimCLR require a large batch size in order to achieve a satisfactory result. In order to remove such requirement, we propose a memory-efficient Stochastic Optimization algorithm for solving the Global objective of Contrastive Learning of Representations, named SogCLR. We show that its optimization error is negligible under a reasonable condition after a sufficient number of iterations or is diminishing for a slightly different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

optimization-ai/sogclr
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · MicroRNA in disease regulation

MethodsBitcoin Customer Service Number +1-833-534-1729 · *Communicated@Fast*How Do I Communicate to Expedia? · Contrastive Learning · Average Pooling · Residual Block · Batch Normalization · Global Average Pooling · Random Gaussian Blur · Max Pooling · 1x1 Convolution