Rethinking Minimal Sufficient Representation in Contrastive Learning
Haoqing Wang, Xun Guo, Zhi-Hong Deng, Yan Lu

TL;DR
This paper reveals that contrastive learning's minimal sufficient representations may omit task-relevant information, leading to performance issues, and proposes a regularization method to incorporate more task-relevant info, improving downstream task performance.
Contribution
It theoretically analyzes the limitations of contrastive learning's minimal representations and introduces a mutual information regularization to enhance task-relevant information inclusion.
Findings
The minimal sufficient representation in contrastive learning is not always sufficient for downstream tasks.
Increasing mutual information between representation and input improves downstream performance.
The proposed method significantly enhances classic contrastive learning models.
Abstract
Contrastive learning between different views of the data achieves outstanding success in the field of self-supervised representation learning and the learned representations are useful in broad downstream tasks. Since all supervision information for one view comes from the other view, contrastive learning approximately obtains the minimal sufficient representation which contains the shared information and eliminates the non-shared information between views. Considering the diversity of the downstream tasks, it cannot be guaranteed that all task-relevant information is shared between views. Therefore, we assume the non-shared task-relevant information cannot be ignored and theoretically prove that the minimal sufficient representation in contrastive learning is not sufficient for the downstream tasks, which causes performance degradation. This reveals a new problem that the contrastive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Geophysical Methods and Applications
MethodsContrastive Learning
