Understanding Dimensional Collapse in Contrastive Self-supervised Learning
Li Jing, Pascal Vincent, Yann LeCun, Yuandong Tian

TL;DR
This paper investigates the phenomenon of dimensional collapse in contrastive self-supervised learning, revealing its occurrence and proposing a new method, DirectCLR, that improves representation quality by directly optimizing the embedding space.
Contribution
The paper uncovers the occurrence of dimensional collapse in contrastive learning and introduces DirectCLR, a novel approach that enhances embeddings without a trainable projector.
Findings
Dimensional collapse occurs in contrastive learning.
DirectCLR outperforms SimCLR on ImageNet.
Theoretical insights into collapse dynamics.
Abstract
Self-supervised visual representation learning aims to learn useful representations without relying on human annotations. Joint embedding approach bases on maximizing the agreement between embedding vectors from different views of the same image. Various methods have been proposed to solve the collapsing problem where all embedding vectors collapse to a trivial constant solution. Among these methods, contrastive learning prevents collapse via negative sample pairs. It has been shown that non-contrastive methods suffer from a lesser collapse problem of a different nature: dimensional collapse, whereby the embedding vectors end up spanning a lower-dimensional subspace instead of the entire available embedding space. Here, we show that dimensional collapse also happens in contrastive learning. In this paper, we shed light on the dynamics at play in contrastive learning that leads to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Image Processing Techniques and Applications · Machine Learning and ELM
MethodsBitcoin Customer Service Number +1-833-534-1729 · Contrastive Learning · 1x1 Convolution · Batch Normalization · Residual Connection · Convolution · Average Pooling · Bottleneck Residual Block · Max Pooling · Dense Connections
