Contrastive Learning with Synthetic Positives
Dewen Zeng, Yawen Wu, Xinrong Hu, Xiaowei Xu, Yiyu Shi

TL;DR
This paper introduces CLSP, a contrastive learning method that uses synthetic images generated by diffusion models as hard positives, improving SSL performance and establishing a new baseline for synthetic data use.
Contribution
It proposes a novel approach to generate synthetic positive samples via diffusion models, enhancing contrastive learning with diverse, semantically similar images.
Findings
Over 2% improvement in linear evaluation accuracy.
Outperforms previous methods on multiple benchmarks.
Excels in transfer learning across most datasets.
Abstract
Contrastive learning with the nearest neighbor has proved to be one of the most efficient self-supervised learning (SSL) techniques by utilizing the similarity of multiple instances within the same class. However, its efficacy is constrained as the nearest neighbor algorithm primarily identifies "easy" positive pairs, where the representations are already closely located in the embedding space. In this paper, we introduce a novel approach called Contrastive Learning with Synthetic Positives (CLSP) that utilizes synthetic images, generated by an unconditional diffusion model, as the additional positives to help the model learn from diverse positives. Through feature interpolation in the diffusion model sampling process, we generate images with distinct backgrounds yet similar semantic content to the anchor image. These images are considered "hard" positives for the anchor image, and when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInnovative Teaching and Learning Methods · Online and Blended Learning
MethodsNearest-Neighbor Contrastive Learning of Visual Representations · Diffusion · Contrastive Learning
