GEDI: GEnerative and DIscriminative Training for Self-Supervised Learning
Emanuele Sansone, Robin Manhaeve

TL;DR
GEDI is a unified framework that combines generative and discriminative self-supervised learning methods, improving clustering and classification performance, especially in small data scenarios, through joint training and logical constraints.
Contribution
This paper introduces GEDI, a novel unified framework that integrates generative and discriminative training for self-supervised learning, supported by Bayesian analysis and experimental validation.
Findings
GEDI outperforms existing methods in clustering accuracy.
Integration with logical constraints enhances performance in small data regimes.
Joint training of generative and discriminative models is effective.
Abstract
Self-supervised learning is a popular and powerful method for utilizing large amounts of unlabeled data, for which a wide variety of training objectives have been proposed in the literature. In this study, we perform a Bayesian analysis of state-of-the-art self-supervised learning objectives and propose a unified formulation based on likelihood learning. Our analysis suggests a simple method for integrating self-supervised learning with generative models, allowing for the joint training of these two seemingly distinct approaches. We refer to this combined framework as GEDI, which stands for GEnerative and DIscriminative training. Additionally, we demonstrate an instantiation of the GEDI framework by integrating an energy-based model with a cluster-based self-supervised learning model. Through experiments on synthetic and real-world data, including SVHN, CIFAR10, and CIFAR100, we show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Domain Adaptation and Few-Shot Learning · Text and Document Classification Technologies
