Angle-Optimized Partial Disentanglement for Multimodal Emotion Recognition in Conversation
Xinyi Che, Wenbo Wang, Yuanbo Hou, Mingjie Xie, Qijun Zhao, Jian Guan

TL;DR
This paper introduces AO-FL, a novel framework for multimodal emotion recognition in conversation that employs angle-optimized partial disentanglement to better capture subtle emotional cues and improve recognition accuracy.
Contribution
The paper proposes an adaptive angular optimization method for partial feature disentanglement in MERC, enhancing the integration of shared and modality-specific features.
Findings
AO-FL outperforms state-of-the-art methods in MERC tasks.
The framework effectively preserves modality-specific cues like micro-expressions and sarcasm.
AO-FL generalizes well to other multimodal tasks such as MER.
Abstract
Multimodal Emotion Recognition in Conversation (MERC) aims to enhance emotion understanding by integrating complementary cues from text, audio, and visual modalities. Existing MERC approaches predominantly focus on cross-modal shared features, often overlooking modality-specific features that capture subtle yet critical emotional cues such as micro-expressions, prosodic variations, and sarcasm. Although related work in multimodal emotion recognition (MER) has explored disentangling shared and modality-specific features, these methods typically employ rigid orthogonal constraints to achieve full disentanglement, which neglects the inherent complementarity between feature types and may limit recognition performance. To address these challenges, we propose Angle-Optimized Feature Learning (AO-FL), a framework tailored for MERC that achieves partial disentanglement of shared and specific…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Sentiment Analysis and Opinion Mining · Face and Expression Recognition
