The phonetic bases of vocal expressed emotion: natural versus acted
Hira Dhamyal, Shahan Ali Memon, Bhiksha Raj, Rita Singh

TL;DR
This study investigates the phonetic differences between natural and acted vocal emotions using a self-attention model, revealing significant discrepancies that question the validity of acted databases for emotion recognition.
Contribution
It introduces a systematic comparison of natural and acted vocal emotions at the phonetic level using a self-attention based classification approach.
Findings
Significant phonetic differences between natural and acted emotions.
Acted speech databases have moderate to low validity for emotion classification.
Natural emotions show distinct phonetic patterns compared to acted emotions.
Abstract
Can vocal emotions be emulated? This question has been a recurrent concern of the speech community, and has also been vigorously investigated. It has been fueled further by its link to the issue of validity of acted emotion databases. Much of the speech and vocal emotion research has relied on acted emotion databases as valid proxies for studying natural emotions. To create models that generalize to natural settings, it is crucial to work with valid prototypes -- ones that can be assumed to reliably represent natural emotions. More concretely, it is important to study emulated emotions against natural emotions in terms of their physiological, and psychological concomitants. In this paper, we present an on-scale systematic study of the differences between natural and acted vocal emotions. We use a self-attention based emotion classification model to understand the phonetic bases of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
