On Deep Multi-View Representation Learning: Objectives and Optimization

Weiran Wang; Raman Arora; Karen Livescu; Jeff Bilmes

arXiv:1602.01024·cs.LG·February 3, 2016·48 cites

On Deep Multi-View Representation Learning: Objectives and Optimization

Weiran Wang, Raman Arora, Karen Livescu, Jeff Bilmes

PDF

Open Access 1 Repo

TL;DR

This paper analyzes various deep multi-view representation learning techniques, highlighting the effectiveness of correlation-based methods and introducing the novel DCCAE approach, with empirical validation across multiple data modalities.

Contribution

The paper provides a comprehensive comparison of existing multi-view learning methods and introduces DCCAE, a new variant that improves performance on diverse tasks.

Findings

01

Correlation-based methods outperform autoencoder-like techniques.

02

DCCAE achieves the best results on most tasks.

03

Stochastic optimization enhances minibatch correlation training.

Abstract

We consider learning representations (features) in the setting in which we have access to multiple unlabeled views of the data for learning while only one view is available for downstream tasks. Previous work on this problem has proposed several techniques based on deep neural networks, typically involving either autoencoder-like networks with a reconstruction objective or paired feedforward networks with a batch-style correlation-based objective. We analyze several techniques based on prior work, as well as new variants, and compare them empirically on image, speech, and text tasks. We find an advantage for correlation-based representation learning, while the best results on most tasks are obtained with our new variant, deep canonically correlated autoencoders (DCCAE). We also explore a stochastic optimization procedure for minibatch correlation-based objectives and discuss the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jiayueg/multimodal
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Video Analysis and Summarization