Detection and Analysis of Emotion From Speech Signals
Assel Davletcharova, Sherin Sugathan, Bibia Abraham, Alex Pappachen, James

TL;DR
This study explores recognizing emotions like neutral, anger, joy, and sadness from speech signals, emphasizing the importance of individual-specific data for improved accuracy in emotion classification.
Contribution
It introduces a method for emotion recognition from speech using peak-to-peak distance features and compares classifiers on a custom dataset from multiple subjects.
Findings
Individual data yields higher classification accuracy.
Peak-to-peak distance is a significant feature for emotion detection.
Classifier performance varies with different emotional states.
Abstract
Recognizing emotion from speech has become one the active research themes in speech processing and in applications based on human-computer interaction. This paper conducts an experimental study on recognizing emotions from human speech. The emotions considered for the experiments include neutral, anger, joy and sadness. The distinuishability of emotional features in speech were studied first followed by emotion classification performed on a custom dataset. The classification was performed for different classifiers. One of the main feature attribute considered in the prepared dataset was the peak-to-peak distance obtained from the graphical representation of the speech signals. After performing the classification tests on a dataset formed from 30 different subjects, it was found that for getting better accuracy, one should consider the data collected from one person rather than…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
