On the Importance of Embedding Norms in Self-Supervised Learning

Andrew Draganov; Sharvaree Vadgama; Sebastian Damrich; Jan Niklas B\"ohm; Lucas Maes; Dmitry Kobak; Erik Bekkers

arXiv:2502.09252·cs.LG·June 12, 2025

On the Importance of Embedding Norms in Self-Supervised Learning

Andrew Draganov, Sharvaree Vadgama, Sebastian Damrich, Jan Niklas B\"ohm, Lucas Maes, Dmitry Kobak, Erik Bekkers

PDF

Open Access 1 Repo

TL;DR

This paper investigates the role of embedding norms in self-supervised learning, revealing their influence on convergence rates and confidence, and demonstrating how manipulating norms affects training dynamics.

Contribution

The study provides a systematic analysis of embedding norms in SSL, establishing their importance in convergence and confidence, supported by theory, simulations, and experiments.

Findings

01

Embedding norms govern SSL convergence rates

02

Embedding norms encode network confidence

03

Manipulating norms impacts convergence speed

Abstract

Self-supervised learning (SSL) allows training data representations without a supervised signal and has become an important paradigm in machine learning. Most SSL methods employ the cosine similarity between embedding vectors and hence effectively embed data on a hypersphere. While this seemingly implies that embedding norms cannot play any role in SSL, a few recent works have suggested that embedding norms have properties related to network convergence and confidence. In this paper, we resolve this apparent contradiction and systematically establish the embedding norm's role in SSL training. Using theoretical analysis, simulations, and experiments, we show that embedding norms (i) govern SSL convergence rates and (ii) encode network confidence, with smaller norms corresponding to unexpected samples. Additionally, we show that manipulating embedding norms can have large effects on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

andrew-draganov/sslembeddingnorms
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInnovative Teaching and Learning Methods · Educational Technology and Assessment