Contrastive MIM: A Contrastive Mutual Information Framework for Unified Generative and Discriminative Representation Learning
Micha Livne

TL;DR
This paper introduces cMIM, a contrastive mutual information framework that enhances representation learning by combining generative and discriminative strengths, outperforming existing methods in various tasks.
Contribution
The paper proposes cMIM, a novel contrastive extension of MIM that does not require positive data augmentation and introduces informative embeddings for improved discriminative performance.
Findings
cMIM outperforms MIM and InfoNCE in classification and regression tasks.
cMIM maintains high-quality generative reconstructions.
Informative embeddings significantly boost discriminative capabilities.
Abstract
Learning representations that generalize well to unknown downstream tasks is a central challenge in representation learning. Existing approaches such as contrastive learning, self-supervised masking, and denoising auto-encoders address this challenge with varying trade-offs. In this paper, we introduce the {contrastive Mutual Information Machine} (cMIM), a probabilistic framework that augments the Mutual Information Machine (MIM) with a novel contrastive objective. While MIM maximizes mutual information between inputs and latent variables and encourages clustering of latent codes, its representations underperform on discriminative tasks compared to state-of-the-art alternatives. cMIM addresses this limitation by enforcing global discriminative structure while retaining MIM's generative strengths. We present two main contributions: (1) we propose cMIM, a contrastive extension of MIM that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Face recognition and analysis
MethodsContrastive Learning · Mutual Information Machine/Mask Image Modeling
