Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework
Haoqin Sun, Shiwan Zhao, Shaokai Li, Xiangyu Kong, Xuechen Wang, Aobo, Kong, Jiaming Zhou, Yong Chen, Wenjia Zeng, Yong Qin

TL;DR
This paper introduces the CM-ARR framework, which improves multimodal emotion recognition when data from some modalities is missing by aligning, reconstructing, and refining representations using contrastive learning and normalizing flows.
Contribution
The novel CM-ARR framework combines cross-modal alignment, reconstruction with normalizing flows, and refinement to handle incomplete multimodal data for emotion recognition.
Findings
CM-ARR outperforms existing methods on IEMOCAP and MSP-IMPROV datasets.
Achieves over 2% improvements in WAR and UAR on average across missing modality scenarios.
Effective in both missing and complete modality conditions.
Abstract
Multimodal emotion recognition systems rely heavily on the full availability of modalities, suffering significant performance declines when modal data is incomplete. To tackle this issue, we present the Cross-Modal Alignment, Reconstruction, and Refinement (CM-ARR) framework, an innovative approach that sequentially engages in cross-modal alignment, reconstruction, and refinement phases to handle missing modalities and enhance emotion recognition. This framework utilizes unsupervised distribution-based contrastive learning to align heterogeneous modal distributions, reducing discrepancies and modeling semantic uncertainty effectively. The reconstruction phase applies normalizing flow models to transform these aligned distributions and recover missing modalities. The refinement phase employs supervised point-based contrastive learning to disrupt semantic correlations and accentuate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Emotion and Mood Recognition
MethodsALIGN · Contrastive Learning
