Uncertainty-Aware Multimodal Emotion Recognition through Dirichlet Parameterization
R\'emi Grzeczkowicz, Eric Soriano, Ali Janati, Miyu Zhang, Gerard Comas-Quiles, Victor Carballo Araruna, Aneesh Jonelagadda

TL;DR
This paper introduces a modular, efficient, and privacy-preserving multimodal emotion recognition framework that captures uncertainty using Dirichlet parameterization, suitable for deployment on edge devices and applicable to various modalities.
Contribution
The work presents a novel uncertainty-aware fusion mechanism based on Dempster-Shafer theory and Dirichlet evidence, enabling robust multimodal emotion recognition without additional training.
Findings
Achieves competitive accuracy on five benchmark datasets.
Demonstrates robustness to ambiguous or missing inputs.
Operates efficiently on edge devices with modular design.
Abstract
In this work, we present a lightweight and privacy-preserving Multimodal Emotion Recognition (MER) framework designed for deployment on edge devices. To demonstrate framework's versatility, our implementation uses three modalities - speech, text and facial imagery. However, the system is fully modular, and can be extended to support other modalities or tasks. Each modality is processed through a dedicated backbone optimized for inference efficiency: Emotion2Vec for speech, a ResNet-based model for facial expressions, and DistilRoBERTa for text. To reconcile uncertainty across modalities, we introduce a model- and task-agnostic fusion mechanism grounded in Dempster-Shafer theory and Dirichlet evidence. Operating directly on model logits, this approach captures predictive uncertainty without requiring additional training or joint distribution estimation, making it broadly applicable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Sentiment Analysis and Opinion Mining · Face and Expression Recognition
