Global Contrastive Batch Sampling via Optimization on Sample Permutations
Vin Sachidananda, Ziyi Yang, Chenguang Zhu

TL;DR
This paper introduces Global Contrastive Batch Sampling (GCBS), an efficient method for batch construction in contrastive learning that improves performance without the drawbacks of hard negative mining.
Contribution
The paper proposes GCBS, a novel approximation method for batch sampling in contrastive learning that reduces computational costs and enhances performance in sentence embedding and code-search tasks.
Findings
GCBS improves state-of-the-art results in sentence embedding.
GCBS enhances performance in code-search tasks.
GCBS is computationally more efficient than traditional hard negative mining.
Abstract
Contrastive Learning has recently achieved state-of-the-art performance in a wide range of tasks. Many contrastive learning approaches use mined hard negatives to make batches more informative during training but these approaches are inefficient as they increase epoch length proportional to the number of mined negatives and require frequent updates of nearest neighbor indices or mining from recent batches. In this work, we provide an alternative to hard negative mining, Global Contrastive Batch Sampling (GCBS), an efficient approximation to the batch assignment problem that upper bounds the gap between the global and training losses, , in contrastive learning settings. Through experimentation we find GCBS improves state-of-the-art performance in sentence embedding and code-search tasks. Additionally, GCBS is easy to implement as it requires…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Text and Document Classification Technologies
MethodsContrastive Learning
