On the Surrogate Gap between Contrastive and Supervised Losses
Han Bao, Yoshihiro Nagano, Kento Nozawa

TL;DR
This paper establishes new theoretical bounds linking contrastive and supervised losses, showing that increasing negative samples reduces the surrogate gap and improves downstream classification, supported by experiments across multiple domains.
Contribution
It provides the first surrogate bounds valid for all negative sample sizes, clarifying the theoretical relationship between contrastive and supervised learning.
Findings
Larger negative sample sizes decrease the surrogate gap.
Contrastive loss acts as a surrogate for downstream classification loss.
Theoretical bounds align with empirical results across datasets.
Abstract
Contrastive representation learning encourages data representation to make semantically similar pairs closer than randomly drawn negative samples, which has been successful in various domains such as vision, language, and graphs. Recent theoretical studies have attempted to explain the benefit of the large negative sample size by upper-bounding the downstream classification loss with the contrastive loss. However, the previous surrogate bounds have two drawbacks: they are only legitimate for a limited range of negative sample sizes and prohibitively large even within that range. Due to these drawbacks, there still does not exist a consensus on how negative sample size theoretically correlates with downstream classification performance. Following the simplified setting where positive pairs are drawn from the true distribution (not generated by data augmentation; as supposed in previous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Natural Language Processing Techniques
