Dynamic Graph Neural ODE Network for Multi-modal Emotion Recognition in Conversation
Yuntao Shou, Tao Meng, Wei Ai, Keqin Li

TL;DR
This paper introduces DGODE, a novel dynamic graph neural ODE network that captures temporal dependencies and reduces overfitting in multimodal emotion recognition, leading to improved performance and deeper GCNs.
Contribution
The paper proposes DGODE, integrating adaptive mixhop and graph ODE to model continuous emotion dynamics and mitigate overfitting in multimodal MERC.
Findings
DGODE outperforms baseline models on two datasets.
It effectively alleviates overfitting and over-smoothing.
Enables construction of deeper GCN networks.
Abstract
Multimodal emotion recognition in conversation (MERC) refers to identifying and classifying human emotional states by combining data from multiple different modalities (e.g., audio, images, text, video, etc.). Most existing multimodal emotion recognition methods use GCN to improve performance, but existing GCN methods are prone to overfitting and cannot capture the temporal dependency of the speaker's emotions. To address the above problems, we propose a Dynamic Graph Neural Ordinary Differential Equation Network (DGODE) for MERC, which combines the dynamic changes of emotions to capture the temporal dependency of speakers' emotions, and effectively alleviates the overfitting problem of GCNs. Technically, the key idea of DGODE is to utilize an adaptive mixhop mechanism to improve the generalization ability of GCNs and use the graph ODE evolution network to characterize the continuous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition
MethodsGraph Convolutional Network
