Speech Emotion Recognition Using CNN and Its Use Case in Digital Healthcare
Nishargo Nigar

TL;DR
This paper presents a CNN-based approach for speech emotion recognition aimed at enhancing digital healthcare by accurately identifying emotional states from voice data, with a focus on improving human-AI interaction.
Contribution
It introduces a novel CNN model for emotion detection from speech and explores its application in digital healthcare contexts.
Findings
High precision, recall, and F1 scores achieved
Effective identification of emotions from audio recordings
Potential for improved human-AI communication in healthcare
Abstract
The process of identifying human emotion and affective states from speech is known as speech emotion recognition (SER). This is based on the observation that tone and pitch in the voice frequently convey underlying emotion. Speech recognition includes the ability to recognize emotions, which is becoming increasingly popular and in high demand. With the help of appropriate factors (such modalities, emotions, intensities, repetitions, etc.) found in the data, my research seeks to use the Convolutional Neural Network (CNN) to distinguish emotions from audio recordings and label them in accordance with the range of different emotions. I have developed a machine learning model to identify emotions from supplied audio files with the aid of machine learning methods. The evaluation is mostly focused on precision, recall, and F1 score, which are common machine learning metrics. To properly set…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInnovation in Digital Healthcare Systems · Education and Learning Interventions
MethodsSparse Evolutionary Training
