MIO : Mutual Information Optimization using Self-Supervised Binary Contrastive Learning
Siladittya Manna, Umapada Pal, Saumik Bhattacharya

TL;DR
This paper introduces a novel mutual information-based loss function for self-supervised contrastive learning, improving performance on multiple benchmark datasets by optimizing mutual information between pairs.
Contribution
It proposes a new loss function modeling contrastive learning as a binary classification task, with a closed-form gradient expression and analytical convergence analysis.
Findings
Outperforms SOTA on CIFAR-10, CIFAR-100, STL-10, Tiny-ImageNet
Achieves top-1 linear evaluation accuracy of 78.4% on ImageNet100
Surpasses existing contrastive methods in accuracy metrics
Abstract
Self-supervised contrastive learning frameworks have progressed rapidly over the last few years. In this paper, we propose a novel loss function for contrastive learning. We model our pre-training task as a binary classification problem to induce an implicit contrastive effect. We further improve the n\"aive loss function after removing the effect of the positive-positive repulsion and incorporating the upper bound of the negative pair repulsion. Unlike existing methods, the proposed loss function optimizes the mutual information in positive and negative pairs. We also present a closed-form expression for the parameter gradient flow and compare the behaviour of self-supervised contrastive frameworks using Hessian eigenspectrum to analytically study their convergence. The proposed method outperforms SOTA self-supervised contrastive frameworks on benchmark datasets such as CIFAR-10,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Human Pose and Action Recognition
MethodsContrastive Learning
