Semi-supervised Bayesian Deep Multi-modal Emotion Recognition
Changde Du, Changying Du, Jinpeng Li, Wei-long Zheng, Bao-liang Lu,, Huiguang He

TL;DR
This paper introduces a semi-supervised Bayesian deep multi-modal emotion recognition model that effectively leverages both labeled and unlabeled multi-modal data, overcoming data scarcity and modality limitations.
Contribution
It presents a novel multi-view deep generative model with a Gaussian mixture prior, extending to semi-supervised learning for emotion recognition from multiple modalities.
Findings
Outperforms existing methods on two real datasets
Learns modality weights automatically
Robust and flexible in handling limited labeled data
Abstract
In emotion recognition, it is difficult to recognize human's emotional states using just a single modality. Besides, the annotation of physiological emotional data is particularly expensive. These two aspects make the building of effective emotion recognition model challenging. In this paper, we first build a multi-view deep generative model to simulate the generative process of multi-modality emotional data. By imposing a mixture of Gaussians assumption on the posterior approximation of the latent variables, our model can learn the shared deep representation from multiple modalities. To solve the labeled-data-scarcity problem, we further extend our multi-view model to semi-supervised learning scenario by casting the semi-supervised classification problem as a specialized missing data imputation task. Our semi-supervised multi-view deep generative framework can leverage both labeled and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Generative Adversarial Networks and Image Synthesis · Human Pose and Action Recognition
