More Synergy, Less Redundancy: Exploiting Joint Mutual Information for Self-Supervised Learning
Salman Mohamadi, Gianfranco Doretto, Donald A. Adjeroh

TL;DR
This paper introduces a novel perspective on mutual information in self-supervised learning, using partial information decomposition to enhance representation learning by reducing redundancy and increasing synergy.
Contribution
It reformulates SSL in terms of joint mutual information and PID, proposing a new training protocol that improves model performance by balancing information components.
Findings
Recalibrated existing redundancy reduction baselines.
Proposed a new SSL training protocol.
Demonstrated effectiveness on multiple datasets and tasks.
Abstract
Self-supervised learning (SSL) is now a serious competitor for supervised learning, even though it does not require data annotation. Several baselines have attempted to make SSL models exploit information about data distribution, and less dependent on the augmentation effect. However, there is no clear consensus on whether maximizing or minimizing the mutual information between representations of augmentation views practically contribute to improvement or degradation in performance of SSL models. This paper is a fundamental work where, we investigate role of mutual information in SSL, and reformulate the problem of SSL in the context of a new perspective on mutual information. To this end, we consider joint mutual information from the perspective of partial information decomposition (PID) as a key step in \textbf{reliable multivariate information measurement}. PID enables us to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Text and Document Classification Technologies
