TL;DR
This paper introduces CoCoNet, a self-supervised learning framework that enhances multi-view representation learning by enforcing global consistency and local complementarity, leading to improved discriminability of learned features.
Contribution
The paper proposes a novel multi-view learning method, CoCoNet, which combines global distribution alignment with local discriminative complementarity regularization.
Findings
Outperforms state-of-the-art self-supervised methods.
Effectively captures implicit shared knowledge among views.
Enhances discriminability of learned representations.
Abstract
While self-supervised learning techniques are often used to mining implicit knowledge from unlabeled data via modeling multiple views, it is unclear how to perform effective representation learning in a complex and inconsistent context. To this end, we propose a methodology, specifically consistency and complementarity network (CoCoNet), which avails of strict global inter-view consistency and local cross-view complementarity preserving regularization to comprehensively learn representations from multiple views. On the global stage, we reckon that the crucial knowledge is implicitly shared among views, and enhancing the encoder to capture such knowledge from data can improve the discriminability of the learned representations. Hence, preserving the global consistency of multiple views ensures the acquisition of common knowledge. CoCoNet aligns the probabilistic distribution of views by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
