Provable Guarantees for Self-Supervised Deep Learning with Spectral   Contrastive Loss

Jeff Z. HaoChen; Colin Wei; Adrien Gaidon; Tengyu Ma

arXiv:2106.04156·cs.LG·June 27, 2022·36 cites

Provable Guarantees for Self-Supervised Deep Learning with Spectral Contrastive Loss

Jeff Z. HaoChen, Colin Wei, Adrien Gaidon, Tengyu Ma

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper provides the first provable theoretical guarantees for contrastive self-supervised learning without assuming independence of positive pairs, using spectral analysis of augmentation graphs, and demonstrates empirical effectiveness on vision datasets.

Contribution

It introduces a novel spectral decomposition approach on augmentation graphs for contrastive learning, removing the independence assumption and providing provable guarantees.

Findings

01

Features learned match or outperform strong baselines.

02

Guarantees hold under linear probe evaluation.

03

Empirical results validate theoretical analysis.

Abstract

Recent works in self-supervised learning have advanced the state-of-the-art by relying on the contrastive learning paradigm, which learns representations by pushing positive pairs, or similar examples from the same class, closer together while keeping negative pairs far apart. Despite the empirical successes, theoretical foundations are limited -- prior analyses assume conditional independence of the positive pairs given the same class label, but recent empirical applications use heavily correlated positive pairs (i.e., data augmentations of the same image). Our work analyzes contrastive learning without assuming conditional independence of positive pairs using a novel concept of the augmentation graph on data. Edges in this graph connect augmentations of the same data, and ground-truth classes naturally form connected sub-graphs. We propose a loss that performs spectral decomposition…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Provable Guarantees for Self-Supervised Deep Learning with Spectral Contrastive Loss· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Machine Learning and ELM

MethodsContrastive Learning