On the Importance of Embedding Norms in Self-Supervised Learning
Andrew Draganov, Sharvaree Vadgama, Sebastian Damrich, Jan Niklas B\"ohm, Lucas Maes, Dmitry Kobak, Erik Bekkers

TL;DR
This paper investigates the role of embedding norms in self-supervised learning, revealing their influence on convergence rates and confidence, and demonstrating how manipulating norms affects training dynamics.
Contribution
The study provides a systematic analysis of embedding norms in SSL, establishing their importance in convergence and confidence, supported by theory, simulations, and experiments.
Findings
Embedding norms govern SSL convergence rates
Embedding norms encode network confidence
Manipulating norms impacts convergence speed
Abstract
Self-supervised learning (SSL) allows training data representations without a supervised signal and has become an important paradigm in machine learning. Most SSL methods employ the cosine similarity between embedding vectors and hence effectively embed data on a hypersphere. While this seemingly implies that embedding norms cannot play any role in SSL, a few recent works have suggested that embedding norms have properties related to network convergence and confidence. In this paper, we resolve this apparent contradiction and systematically establish the embedding norm's role in SSL training. Using theoretical analysis, simulations, and experiments, we show that embedding norms (i) govern SSL convergence rates and (ii) encode network confidence, with smaller norms corresponding to unexpected samples. Additionally, we show that manipulating embedding norms can have large effects on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInnovative Teaching and Learning Methods · Educational Technology and Assessment
