Self-Supervised Learning with Kernel Dependence Maximization
Yazhe Li, Roman Pogodin, Danica J. Sutherland, Arthur Gretton

TL;DR
This paper introduces SSL-HSIC, a self-supervised learning framework that maximizes statistical dependence between transformed image representations, providing a new understanding of mutual information bounds and achieving state-of-the-art results.
Contribution
It proposes SSL-HSIC, a novel dependence-maximization approach for self-supervised learning, offering theoretical insights and practical improvements over existing methods like InfoNCE and BYOL.
Findings
SSL-HSIC matches state-of-the-art performance on ImageNet
It enables direct dependence optimization with linear time complexity
Effective across various vision tasks including segmentation and depth estimation
Abstract
We approach self-supervised learning of image representations from a statistical dependence perspective, proposing Self-Supervised Learning with the Hilbert-Schmidt Independence Criterion (SSL-HSIC). SSL-HSIC maximizes dependence between representations of transformations of an image and the image identity, while minimizing the kernelized variance of those representations. This framework yields a new understanding of InfoNCE, a variational lower bound on the mutual information (MI) between different transformations. While the MI itself is known to have pathologies which can result in learning meaningless representations, its bound is much better behaved: we show that it implicitly approximates SSL-HSIC (with a slightly different regularizer). Our approach also gives us insight into BYOL, a negative-free SSL method, since SSL-HSIC similarly learns local neighborhoods of samples. SSL-HSIC…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Anomaly Detection Techniques and Applications
MethodsBootstrap Your Own Latent · InfoNCE
