TL;DR
This paper introduces an iterative feature engineering method for speech emotion recognition that enhances model accuracy and explainability by selecting relevant features using Shapley values, outperforming existing methods.
Contribution
The paper presents a novel supervised SER approach that emphasizes explainability and iterative feature selection, improving performance and transparency over prior techniques.
Findings
Outperforms human-level performance on TESS dataset
Achieves state-of-the-art accuracy in speech emotion recognition
Balances model performance with interpretability
Abstract
In speech emotion recognition (SER), using predefined features without considering their practical importance may lead to high dimensional datasets, including redundant and irrelevant information. Consequently, high-dimensional learning often results in decreasing model accuracy while increasing computational complexity. Our work underlines the importance of carefully considering and analyzing features in order to build efficient SER systems. We present a new supervised SER method based on an efficient feature engineering approach. We pay particular attention to the explainability of results to evaluate feature relevance and refine feature sets. This is performed iteratively through feature evaluation loop, using Shapley values to boost feature selection and improve overall framework performance. Our approach allows thus to balance the benefits between model performance and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFeature Selection
