TL;DR
This paper introduces DisCoder, an unsupervised discriminative model that learns useful features by jointly modeling data and latent variables, achieving state-of-the-art results without relying on labeled data.
Contribution
It proposes a novel EM-like training method with adversarial regularization for unsupervised feature learning using discriminative models.
Findings
Achieves state-of-the-art performance on several tasks.
Effectively learns both one-hot and real-valued latent features.
Demonstrates flexibility and robustness in unsupervised learning.
Abstract
In recent years, deep discriminative models have achieved extraordinary performance on supervised learning tasks, significantly outperforming their generative counterparts. However, their success relies on the presence of a large amount of labeled data. How can one use the same discriminative models for learning useful features in the absence of labels? We address this question in this paper, by jointly modeling the distribution of data and latent features in a manner that explicitly assigns zero probability to unobserved data. Rather than maximizing the marginal probability of observed data, we maximize the joint probability of the data and the latent features using a two step EM-like procedure. To prevent the model from overfitting to our initial selection of latent features, we use adversarial regularization. Depending on the task, we allow the latent features to be one-hot or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
