Provable Guarantees for Self-Supervised Deep Learning with Spectral Contrastive Loss
Jeff Z. HaoChen, Colin Wei, Adrien Gaidon, Tengyu Ma

TL;DR
This paper provides the first provable theoretical guarantees for contrastive self-supervised learning without assuming independence of positive pairs, using spectral analysis of augmentation graphs, and demonstrates empirical effectiveness on vision datasets.
Contribution
It introduces a novel spectral decomposition approach on augmentation graphs for contrastive learning, removing the independence assumption and providing provable guarantees.
Findings
Features learned match or outperform strong baselines.
Guarantees hold under linear probe evaluation.
Empirical results validate theoretical analysis.
Abstract
Recent works in self-supervised learning have advanced the state-of-the-art by relying on the contrastive learning paradigm, which learns representations by pushing positive pairs, or similar examples from the same class, closer together while keeping negative pairs far apart. Despite the empirical successes, theoretical foundations are limited -- prior analyses assume conditional independence of the positive pairs given the same class label, but recent empirical applications use heavily correlated positive pairs (i.e., data augmentations of the same image). Our work analyzes contrastive learning without assuming conditional independence of positive pairs using a novel concept of the augmentation graph on data. Edges in this graph connect augmentations of the same data, and ground-truth classes naturally form connected sub-graphs. We propose a loss that performs spectral decomposition…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Machine Learning and ELM
MethodsContrastive Learning
