End-to-End Label Uncertainty Modeling in Speech Emotion Recognition using Bayesian Neural Networks and Label Distribution Learning
Navin Raj Prabhu, Nale Lehmann-Willenbrock, Timo Gerkman

TL;DR
This paper introduces a Bayesian neural network approach that models label uncertainty in speech emotion recognition by using Student's t-distribution, capturing subjectivity and improving uncertainty estimation.
Contribution
It proposes an end-to-end method using Student's t-distribution for better modeling of annotation subjectivity in speech emotion recognition.
Findings
Achieves state-of-the-art uncertainty modeling results.
Performs well in cross-corpora evaluations.
T-distribution's advantage increases with more annotator correlation and fewer annotations.
Abstract
To train machine learning algorithms to predict emotional expressions in terms of arousal and valence, annotated datasets are needed. However, as different people perceive others' emotional expressions differently, their annotations are subjective. To account for this, annotations are typically collected from multiple annotators and averaged to obtain ground-truth labels. However, when exclusively trained on this averaged ground-truth, the model is agnostic to the inherent subjectivity in emotional expressions. In this work, we therefore propose an end-to-end Bayesian neural network capable of being trained on a distribution of annotations to also capture the subjectivity-based label uncertainty. Instead of a Gaussian, we model the annotation distribution using Student's t-distribution, which also accounts for the number of annotations available. We derive the corresponding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Speech Recognition and Synthesis · Sentiment Analysis and Opinion Mining
