"We care": Improving Code Mixed Speech Emotion Recognition in Customer-Care Conversations
N V S Abhishek, Pushpak Bhattacharyya

TL;DR
This paper introduces NSED, a natural code-mixed speech emotion dataset for customer-care conversations, and demonstrates that incorporating word-level VAD values improves emotion recognition accuracy, especially for negative emotions.
Contribution
The creation of NSED, a natural, code-mixed speech emotion dataset with VAD annotations, and showing that word-level VAD integration enhances SER performance in noisy, real-world scenarios.
Findings
Incorporating word-level VAD improves negative emotion recognition by 2%.
NSED dataset captures natural code-mixed speech in customer care contexts.
Enhanced SER accuracy can lead to more empathetic conversational agents.
Abstract
Speech Emotion Recognition (SER) is the task of identifying the emotion expressed in a spoken utterance. Emotion recognition is essential in building robust conversational agents in domains such as law, healthcare, education, and customer support. Most of the studies published on SER use datasets created by employing professional actors in a noise-free environment. In natural settings such as a customer care conversation, the audio is often noisy with speakers regularly switching between different languages as they see fit. We have worked in collaboration with a leading unicorn in the Conversational AI sector to develop Natural Speech Emotion Dataset (NSED). NSED is a natural code-mixed speech emotion dataset where each utterance in a conversation is annotated with emotion, sentiment, valence, arousal, and dominance (VAD) values. In this paper, we show that by incorporating word-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Sentiment Analysis and Opinion Mining · Emotion and Mood Recognition
MethodsIs Venmo Customer Support Available 24/7? How to Reach a Real Person
