Towards the Generalization of Contrastive Self-Supervised Learning

Weiran Huang; Mingyang Yi; Xuyang Zhao; Zihao Jiang

arXiv:2111.00743·cs.LG·March 3, 2023·21 cites

Towards the Generalization of Contrastive Self-Supervised Learning

Weiran Huang, Mingyang Yi, Xuyang Zhao, Zihao Jiang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper provides a theoretical framework for understanding the generalization of contrastive self-supervised learning, linking it to data augmentation, representation alignment, and class divergence, supported by empirical validation.

Contribution

It introduces a $(\sigma,\delta)$-measure to quantify data augmentation and derives an upper bound on classification error, connecting theoretical insights with practical contrastive loss functions.

Findings

01

Generalization depends on data augmentation, representation alignment, and class divergence.

02

InfoNCE and cross-correlation losses achieve key representation properties.

03

Stronger data concentration correlates with better downstream performance.

Abstract

Recently, self-supervised learning has attracted great attention, since it only requires unlabeled data for model training. Contrastive learning is one popular method for self-supervised learning and has achieved promising empirical performance. However, the theoretical understanding of its generalization ability is still limited. To this end, we define a kind of $(σ, δ)$ -measure to mathematically quantify the data augmentation, and then provide an upper bound of the downstream classification error rate based on the measure. It reveals that the generalization ability of contrastive self-supervised learning is related to three key factors: alignment of positive samples, divergence of class centers, and concentration of augmented data. The first two factors are properties of learned representations, while the third one is determined by pre-defined data augmentation. We further…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

huang-research-group/contrastive2021
pytorchOfficial

Videos

Towards the Generalization of Contrastive Self-Supervised Learning· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Cancer-related molecular mechanisms research · Human Pose and Action Recognition

MethodsBitcoin Customer Service Number +1-833-534-1729 · Contrastive Learning · InfoNCE · 1x1 Convolution · Residual Block · Max Pooling · Kaiming Initialization · Average Pooling · Global Average Pooling · Batch Normalization