TL;DR
This paper introduces CorrMCNN, a novel deep autoencoder-based model that enhances multi-view data representation by increasing interaction at each step, outperforming existing methods on benchmark datasets.
Contribution
The paper proposes a new step-based correlation multi-modal CNN that improves common representation learning by integrating interactions at each hidden layer.
Findings
Achieves better performance than state-of-the-art on MNIST and XRMB datasets.
Effective in joint common representation learning and transfer learning.
Demonstrates the advantage of step-based correlation in multi-modal CNNs.
Abstract
Deep learning techniques have been successfully used in learning a common representation for multi-view data, wherein the different modalities are projected onto a common subspace. In a broader perspective, the techniques used to investigate common representation learning falls under the categories of canonical correlation-based approaches and autoencoder based approaches. In this paper, we investigate the performance of deep autoencoder based methods on multi-view data. We propose a novel step-based correlation multi-modal CNN (CorrMCNN) which reconstructs one view of the data given the other while increasing the interaction between the representations at each hidden layer or every intermediate step. Finally, we evaluate the performance of the proposed model on two benchmark datasets - MNIST and XRMB. Through extensive experiments, we find that the proposed model achieves better…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSolana Customer Service Number +1-833-534-1729
