Empirical Interpretation of the Relationship Between Speech Acoustic Context and Emotion Recognition
Anna Ollerenshaw, Md Asif Jalal, Rosanna Milner, Thomas Hain

TL;DR
This paper investigates how acoustic context and phone boundaries influence speech emotion recognition, proposing an attention-based distributed approach that aligns with psycholinguistic theories and improves understanding across different speech datasets.
Contribution
It introduces an attention-based method that models emotion as a distributed over continuous time-windows, bridging psycholinguistic insights with computational SER models.
Findings
Distributed approach enhances cross-corpora SER performance
Attention vectors reveal overlapping distributions of phonemes and emotion
Using fundamental frequency improves emotion localization
Abstract
Speech emotion recognition (SER) is vital for obtaining emotional intelligence and understanding the contextual meaning of speech. Variations of consonant-vowel (CV) phonemic boundaries can enrich acoustic context with linguistic cues, which impacts SER. In practice, speech emotions are treated as single labels over an acoustic segment for a given time duration. However, phone boundaries within speech are not discrete events, therefore the perceived emotion state should also be distributed over potentially continuous time-windows. This research explores the implication of acoustic context and phone boundaries on local markers for SER using an attention-based approach. The benefits of using a distributed approach to speech emotion understanding are supported by the results of cross-corpora analysis experiments. Experiments where phones and words are mapped to the attention vectors…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition
