Are you sure? Analysing Uncertainty Quantification Approaches for   Real-world Speech Emotion Recognition

Oliver Schr\"ufer; Manuel Milling; Felix Burkhardt; Florian Eyben,; Bj\"orn Schuller

arXiv:2407.01143·cs.SD·July 2, 2024

Are you sure? Analysing Uncertainty Quantification Approaches for Real-world Speech Emotion Recognition

Oliver Schr\"ufer, Manuel Milling, Felix Burkhardt, Florian Eyben,, Bj\"orn Schuller

PDF

Open Access 1 Repo

TL;DR

This paper evaluates various uncertainty quantification methods for speech emotion recognition, emphasizing their effectiveness in real-world conditions like corrupted signals and out-of-distribution data, and highlights simple methods' potential.

Contribution

The study provides an empirical assessment of UQ approaches for SER under real-world challenges, demonstrating the benefits of training with OOD data for better uncertainty detection.

Findings

01

Simple UQ methods can indicate prediction uncertainty

02

Training with OOD data improves uncertainty detection

03

UQ methods help identify faulty predictions in SER

Abstract

Uncertainty Quantification (UQ) is an important building block for the reliable use of neural networks in real-world scenarios, as it can be a useful tool in identifying faulty predictions. Speech emotion recognition (SER) models can suffer from particularly many sources of uncertainty, such as the ambiguity of emotions, Out-of-Distribution (OOD) data or, in general, poor recording conditions. Reliable UQ methods are thus of particular interest as in many SER applications no prediction is better than a faulty prediction. While the effects of label ambiguity on uncertainty are well documented in the literature, we focus our work on an evaluation of UQ methods for SER under common challenges in real-world application, such as corrupted signals, and the absence of speech. We show that simple UQ methods can already give an indication of the uncertainty of a prediction and that training with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

audeering/ser-uncertainty-quantification
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Speech Recognition and Synthesis

MethodsFocus