Bridge Correlational Neural Networks for Multilingual Multimodal   Representation Learning

Janarthanan Rajendran; Mitesh M. Khapra; Sarath Chandar; Balaraman; Ravindran

arXiv:1510.03519·cs.CL·July 4, 2016

Bridge Correlational Neural Networks for Multilingual Multimodal Representation Learning

Janarthanan Rajendran, Mitesh M. Khapra, Sarath Chandar, Balaraman, Ravindran

PDF

1 Repo

TL;DR

This paper introduces a novel neural network model that learns shared representations across multiple views using only pivot-based parallel data, enabling cross-lingual and multimodal tasks without direct pairings.

Contribution

It proposes a generic bridge correlational neural network model that effectively learns common representations across multiple views with only pivot-view data, applicable to n views.

Findings

01

Achieved state-of-the-art in multilingual document classification.

02

Demonstrated promising results in multilingual multimodal retrieval.

03

Validated the model on a new dataset created for this purpose.

Abstract

Recently there has been a lot of interest in learning common representations for multiple views of data. Typically, such common representations are learned using a parallel corpus between the two views (say, 1M images and their English captions). In this work, we address a real-world scenario where no direct parallel data is available between two views of interest (say, $V_{1}$ and $V_{2}$ ) but parallel data is available between each of these views and a pivot view ( $V_{3}$ ). We propose a model for learning a common representation for $V_{1}$ , $V_{2}$ and $V_{3}$ using only the parallel data available between $V_{1} V_{3}$ and $V_{2} V_{3}$ . The proposed model is generic and even works when there are $n$ views of interest and only one pivot view which acts as a bridge between them. There are two specific downstream applications that we focus on (i) transfer learning between languages $L_{1}$ , $L_{2}$ ,..., $L_{n}$ …

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

adobe-research/Cross-lingual-Test-Dataset-XTD10
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.