The Triad of Failure Modes and a Possible Way Out
Emanuele Sansone

TL;DR
This paper introduces a new objective function for cluster-based self-supervised learning that effectively addresses common failure modes, allowing simpler training without complex components, and is supported by theoretical and experimental validation.
Contribution
The paper proposes a novel, theoretically grounded objective for SSL that avoids complex training tricks and tackles key failure modes in clustering-based methods.
Findings
Effective in preventing representation and cluster collapse
Addresses invariance to data augmentation and cluster permutation
Works well on toy and real-world datasets
Abstract
We present a novel objective function for cluster-based self-supervised learning (SSL) that is designed to circumvent the triad of failure modes, namely representation collapse, cluster collapse, and the problem of invariance to permutations of cluster assignments. This objective consists of three key components: (i) A generative term that penalizes representation collapse, (ii) a term that promotes invariance to data augmentations, thereby addressing the issue of label permutations and (ii) a uniformity term that penalizes cluster collapse. Additionally, our proposed objective possesses two notable advantages. Firstly, it can be interpreted from a Bayesian perspective as a lower bound on the data log-likelihood. Secondly, it enables the training of a standard backbone architecture without the need for asymmetric elements like stop gradients, momentum encoders, or specialized clustering…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Text and Document Classification Technologies · Face and Expression Recognition
