Infinite Width Limits of Self Supervised Neural Networks

Maximilian Fleissner; Gautham Govind Anil; Debarghya Ghoshdastidar

arXiv:2411.11176·cs.LG·May 6, 2025

Infinite Width Limits of Self Supervised Neural Networks

Maximilian Fleissner, Gautham Govind Anil, Debarghya Ghoshdastidar

PDF

Open Access

TL;DR

This paper proves that the NTK of two-layer neural networks trained with Barlow Twins loss converges to a constant as width increases, providing a theoretical foundation for using kernel methods to analyze self-supervised learning.

Contribution

It establishes the first rigorous connection between NTK and self-supervised learning with Barlow Twins, showing NTK convergence in the infinite width limit.

Findings

01

NTK of Barlow Twins becomes constant at infinite width

02

Provides generalization bounds for kernelized Barlow Twins

03

Connects kernel theory with finite-width neural networks

Abstract

The NTK is a widely used tool in the theoretical analysis of deep learning, allowing us to look at supervised deep neural networks through the lenses of kernel regression. Recently, several works have investigated kernel models for self-supervised learning, hypothesizing that these also shed light on the behavior of wide neural networks by virtue of the NTK. However, it remains an open question to what extent this connection is mathematically sound -- it is a commonly encountered misbelief that the kernel behavior of wide neural networks emerges irrespective of the loss function it is trained on. In this paper, we bridge the gap between the NTK and self-supervised learning, focusing on two-layer neural networks trained under the Barlow Twins loss. We prove that the NTK of Barlow Twins indeed becomes constant as the width of the network approaches infinity. Our analysis technique is a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsBarlow Twins · Neural Tangent Kernel