TL;DR
This paper introduces an explainable, iterative feature boosting method for speech emotion recognition that improves accuracy by selecting relevant features and enhancing model transparency, validated on multiple benchmark datasets.
Contribution
It presents the first integration of model explainability into an SER framework, using Shapley values for iterative feature refinement to boost performance and interpretability.
Findings
Outperforms state-of-the-art methods on multiple SER benchmarks
Effectively identifies and removes irrelevant features
Enhances model transparency and understanding
Abstract
Speech emotion recognition (SER) has gained significant attention due to its several application fields, such as mental health, education, and human-computer interaction. However, the accuracy of SER systems is hindered by high-dimensional feature sets that may contain irrelevant and redundant information. To overcome this challenge, this study proposes an iterative feature boosting approach for SER that emphasizes feature relevance and explainability to enhance machine learning model performance. Our approach involves meticulous feature selection and analysis to build efficient SER systems. In addressing our main problem through model explainability, we employ a feature evaluation loop with Shapley values to iteratively refine feature sets. This process strikes a balance between model performance and transparency, which enables a comprehensive understanding of the model's predictions.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSparse Evolutionary Training · Feature Selection
