Deep Imbalanced Learning for Multimodal Emotion Recognition in Conversations
Tao Meng, Yuntao Shou, Wei Ai, Nan Yin, Keqin Li

TL;DR
This paper introduces CBERL, a novel multimodal emotion recognition model that effectively handles imbalanced data, improving accuracy especially on minority emotion classes through data augmentation, feature fusion, and boundary learning techniques.
Contribution
The paper proposes a comprehensive framework combining data augmentation, feature fusion, and boundary learning to address class imbalance in multimodal emotion recognition.
Findings
Improved accuracy on minority classes by 10-20%.
Enhanced feature representations through multimodal fusion.
Effective handling of imbalanced emotion datasets.
Abstract
The main task of Multimodal Emotion Recognition in Conversations (MERC) is to identify the emotions in modalities, e.g., text, audio, image and video, which is a significant development direction for realizing machine intelligence. However, many data in MERC naturally exhibit an imbalanced distribution of emotion categories, and researchers ignore the negative impact of imbalanced data on emotion recognition. To tackle this problem, we systematically analyze it from three aspects: data augmentation, loss sensitivity, and sampling strategy, and propose the Class Boundary Enhanced Representation Learning (CBERL) model. Concretely, we first design a multimodal generative adversarial network to address the imbalanced distribution of {emotion} categories in raw data. Secondly, a deep joint variational autoencoder is proposed to fuse complementary semantic information across modalities and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Emotion and Mood Recognition
MethodsGraph Neural Network
