Revisiting Multimodal Emotion Recognition in Conversation from the Perspective of Graph Spectrum
Tao Meng, Fuchen Zhang, Yuntao Shou, Wei Ai, Nan Yin, Keqin Li

TL;DR
This paper introduces GS-MCC, a graph-spectrum-based framework that enhances multimodal emotion recognition in conversations by effectively capturing long-distance and frequency-specific semantic features, outperforming existing GNN methods.
Contribution
The paper proposes a novel graph-spectrum approach using Fourier graph operators and contrastive learning to better model semantic consistency and complementarity in multimodal dialogue data.
Findings
GS-MCC outperforms existing methods on benchmark datasets.
Effective extraction of high and low-frequency information improves emotion recognition.
Contrastive learning enhances the collaboration of semantic features.
Abstract
Efficiently capturing consistent and complementary semantic features in a multimodal conversation context is crucial for Multimodal Emotion Recognition in Conversation (MERC). Existing methods mainly use graph structures to model dialogue context semantic dependencies and employ Graph Neural Networks (GNN) to capture multimodal semantic features for emotion recognition. However, these methods are limited by some inherent characteristics of GNN, such as over-smoothing and low-pass filtering, resulting in the inability to learn long-distance consistency information and complementary information efficiently. Since consistency and complementarity information correspond to low-frequency and high-frequency information, respectively, this paper revisits the problem of multimodal emotion recognition in conversation from the perspective of the graph spectrum. Specifically, we propose a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Educational Technology and Pedagogy · Emotion and Mood Recognition
MethodsSoftmax · Contrastive Learning
