TL;DR
This paper introduces a novel multi-view representation disentangling method that enhances interpretability and generalization by going beyond traditional inductive biases, using a two-stage framework to extract consistent and specific features.
Contribution
The proposed method uniquely discovers multi-view consistency to guide disentangling without relying on strong inductive biases, improving interpretability and performance.
Findings
Outperforms 12 comparison methods in clustering and classification.
Produces compact and interpretable representations.
Demonstrates effectiveness on four multi-view datasets.
Abstract
Multi-view (or -modality) representation learning aims to understand the relationships between different view representations. Existing methods disentangle multi-view representations into consistent and view-specific representations by introducing strong inductive biases, which can limit their generalization ability. In this paper, we propose a novel multi-view representation disentangling method that aims to go beyond inductive biases, ensuring both interpretability and generalizability of the resulting representations. Our method is based on the observation that discovering multi-view consistency in advance can determine the disentangling information boundary, leading to a decoupled learning objective. We also found that the consistency can be easily extracted by maximizing the transformation invariance and clustering consistency between views. These observations drive us to propose a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
