Graph-based Interaction Augmentation Network for Robust Multimodal Sentiment Analysis
Hu Zhangfeng, Shi mengxin

TL;DR
This paper introduces a graph-based framework for multimodal sentiment analysis that models complex intra- and inter-modality interactions, improving robustness against modality imperfections by leveraging complementary information.
Contribution
It proposes a novel hypergraph and directed graph approach to exploit intra- and inter-modality dependencies, enhancing the extraction of missing semantics in imperfect multimodal data.
Findings
Effective in handling modality imperfections
Outperforms existing methods on MOSI and MOSEI datasets
Enhances robustness of multimodal sentiment analysis
Abstract
The inevitable modality imperfection in real-world scenarios poses significant challenges for Multimodal Sentiment Analysis (MSA). While existing methods tailor reconstruction or joint representation learning strategies to restore missing semantics, they often overlook complex dependencies within and across modalities. Consequently, they fail to fully leverage available modalities to capture complementary semantics. To this end, this paper proposes a novel graph-based framework to exploit both intra- and inter-modality interactions, enabling imperfect samples to derive missing semantics from complementary parts for robust MSA. Specifically, we first devise a learnable hypergraph to model intra-modality temporal dependencies to exploit contextual information within each modality. Then, a directed graph is employed to explore inter-modality correlations based on attention mechanism,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Sentiment Analysis and Opinion Mining · Multimodal Machine Learning Applications
