Found in Translation: Learning Robust Joint Representations by Cyclic Translations Between Modalities
Hai Pham, Paul Pu Liang, Thomas Manzini, Louis-Philippe Morency,, Barnabas Poczos

TL;DR
This paper introduces a novel translation-based approach to learn robust joint representations for multimodal sentiment analysis, allowing effective predictions even with missing or noisy modalities, and achieves state-of-the-art results.
Contribution
It proposes a translation and cycle consistency framework that learns joint representations from a single modality, enhancing robustness and performance in multimodal sentiment analysis.
Findings
Achieves state-of-the-art results on multiple datasets
Models remain robust with missing or noisy modalities
Learns discriminative joint representations with fewer modalities
Abstract
Multimodal sentiment analysis is a core research area that studies speaker sentiment expressed from the language, visual, and acoustic modalities. The central challenge in multimodal learning involves inferring joint representations that can process and relate information from these modalities. However, existing work learns joint representations by requiring all modalities as input and as a result, the learned representations may be sensitive to noisy or missing modalities at test time. With the recent success of sequence to sequence (Seq2Seq) models in machine translation, there is an opportunity to explore new ways of learning joint representations that may not require all input modalities at test time. In this paper, we propose a method to learn robust joint representations by translating between modalities. Our method is based on the key insight that translation from a source to a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Topic Modeling · Multimodal Machine Learning Applications
MethodsCycle Consistency Loss
