Memory Fusion Network for Multi-view Sequential Learning
Amir Zadeh, Paul Pu Liang, Navonil Mazumder, Soujanya Poria, Erik, Cambria, Louis-Philippe Morency

TL;DR
The paper introduces the Memory Fusion Network (MFN), a neural architecture that models view-specific and cross-view interactions in multi-view sequential learning, achieving state-of-the-art results on benchmark datasets.
Contribution
The novel MFN architecture explicitly models both view-specific and cross-view interactions through a specialized attention mechanism and memory component.
Findings
MFN outperforms existing multi-view learning approaches.
MFN achieves new state-of-the-art results on benchmark datasets.
Extensive experiments validate the effectiveness of the proposed architecture.
Abstract
Multi-view sequential learning is a fundamental problem in machine learning dealing with multi-view sequences. In a multi-view sequence, there exists two forms of interactions between different views: view-specific interactions and cross-view interactions. In this paper, we present a new neural architecture for multi-view sequential learning called the Memory Fusion Network (MFN) that explicitly accounts for both interactions in a neural architecture and continuously models them through time. The first component of the MFN is called the System of LSTMs, where view-specific interactions are learned in isolation through assigning an LSTM function to each view. The cross-view interactions are then identified using a special attention mechanism called the Delta-memory Attention Network (DMAN) and summarized through time with a Multi-view Gated Memory. Through extensive experimentation, MFN…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · Face and Expression Recognition
MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory
