TCAN: Text-oriented Cross Attention Network for Multimodal Sentiment Analysis
Weize Quan, Yunfei Feng, Ming Zhou, Yunzhen Zhao, Tong Wang, and, Dong-Ming Yan

TL;DR
This paper introduces TCAN, a multimodal sentiment analysis model that emphasizes the dominant role of text by using cross-attention mechanisms, improving performance by addressing modality heterogeneity.
Contribution
The paper proposes a novel Text-oriented Cross-Attention Network (TCAN) that prioritizes text modality and incorporates gated control and joint learning to enhance multimodal sentiment analysis.
Findings
TCAN outperforms state-of-the-art methods on CMU-MOSI and CMU-MOSEI datasets.
The model effectively mitigates noise and redundancy through gated control.
Focusing on text improves sentiment prediction accuracy.
Abstract
Multimodal Sentiment Analysis (MSA) endeavors to understand human sentiment by leveraging language, visual, and acoustic modalities. Despite the remarkable performance exhibited by previous MSA approaches, the presence of inherent multimodal heterogeneities poses a challenge, with the contribution of different modalities varying considerably. Past research predominantly focused on improving representation learning techniques and feature fusion strategies. However, many of these efforts overlooked the variation in semantic richness among different modalities, treating each modality uniformly. This approach may lead to underestimating the significance of strong modalities while overemphasizing the importance of weak ones. Motivated by these insights, we introduce a Text-oriented Cross-Attention Network (TCAN), emphasizing the predominant role of the text modality in MSA. Specifically, for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Advanced Text Analysis Techniques · Topic Modeling
