CAGMamba: Context-Aware Gated Cross-Modal Mamba Network for Multimodal Sentiment Analysis

Minghai Jiao; Jing Xiao; Peng Xiao; Ende Zhang; Shuang Kan; Wenyan Jiang; Jinyao Li; Yixian Liu; Haidong Xin

arXiv:2604.03650·cs.CL·April 7, 2026

CAGMamba: Context-Aware Gated Cross-Modal Mamba Network for Multimodal Sentiment Analysis

Minghai Jiao, Jing Xiao, Peng Xiao, Ende Zhang, Shuang Kan, Wenyan Jiang, Jinyao Li, Yixian Liu, Haidong Xin

PDF

1 Repo

TL;DR

CAGMamba introduces a novel, efficient, and temporally aware framework for multimodal sentiment analysis that explicitly models sentiment evolution and balances cross-modal fusion using gating mechanisms.

Contribution

It proposes a context-aware gated cross-modal Mamba network with explicit temporal modeling and controllable fusion for dialogue-based sentiment analysis.

Findings

01

Achieves state-of-the-art results on benchmark datasets.

02

Effectively models sentiment evolution across dialogue turns.

03

Balances modality preservation and fusion through learnable gating.

Abstract

Multimodal Sentiment Analysis (MSA) requires effective modeling of cross-modal interactions and contextual dependencies while remaining computationally efficient. Existing fusion approaches predominantly rely on Transformer-based cross-modal attention, which incurs quadratic complexity with respect to sequence length and limits scalability. Moreover, contextual information from preceding utterances is often incorporated through concatenation or independent fusion, without explicit temporal modeling that captures sentiment evolution across dialogue turns. To address these limitations, we propose CAGMamba, a context-aware gated cross-modal Mamba framework for dialogue-based sentiment analysis. Specifically, we organize the contextual and the current-utterance features into a temporally ordered binary sequence, which provides Mamba with explicit temporal structure for modeling sentiment…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

User2024-xj/CAGMamba
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.