Towards a Unified Theoretical Understanding of Non-contrastive Learning via Rank Differential Mechanism
Zhijian Zhuo, Yifei Wang, Jinwen Ma, Yisen Wang

TL;DR
This paper introduces the Rank Differential Mechanism (RDM), a unified theoretical framework explaining how asymmetric designs in non-contrastive learning prevent feature collapse by creating a rank difference in feature representations.
Contribution
The paper proposes RDM, a unified theory applicable to various non-contrastive learning methods, explaining their effectiveness and guiding the design of new variants.
Findings
RDM explains how asymmetric designs prevent feature collapse.
New variants based on RDM achieve comparable or better performance.
Theoretical insights guide the development of effective non-contrastive methods.
Abstract
Recently, a variety of methods under the name of non-contrastive learning (like BYOL, SimSiam, SwAV, DINO) show that when equipped with some asymmetric architectural designs, aligning positive pairs alone is sufficient to attain good performance in self-supervised visual learning. Despite some understandings of some specific modules (like the predictor in BYOL), there is yet no unified theoretical understanding of how these seemingly different asymmetric designs can all avoid feature collapse, particularly considering methods that also work without the predictor (like DINO). In this work, we propose a unified theoretical understanding for existing variants of non-contrastive learning. Our theory named Rank Differential Mechanism (RDM) shows that all these asymmetric designs create a consistent rank difference in their dual-branch output features. This rank difference will provably lead…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsCancer-related molecular mechanisms research · AI and Multimedia in Education · Domain Adaptation and Few-Shot Learning
MethodsLARS · Swapping Assignments between Views · Bootstrap Your Own Latent
