1st Place Solution to Odyssey Emotion Recognition Challenge Task1: Tackling Class Imbalance Problem
Mingjie Chen, Hezhao Zhang, Yuanchao Li, Jiachen Luo, Wen Wu, Ziyang, Ma, Peter Bell, Catherine Lai, Joshua Reiss, Lin Wang, Philip C. Woodland,, Xie Chen, Huy Phan, Thomas Hain

TL;DR
This paper presents a top-performing ensemble system for speech emotion recognition that effectively addresses class imbalance by combining focal loss with class weights and using diverse models, achieving top challenge ranking.
Contribution
The novel approach integrates focal loss weighted by class priors with ensemble voting, improving minority class recognition in imbalanced emotion recognition data.
Findings
Achieved top-1 ranking in the Odyssey 2024 challenge
Ensemble outperformed individual models in accuracy and F1 score
Focal loss with class weights improved minority class performance
Abstract
Speech emotion recognition is a challenging classification task with natural emotional speech, especially when the distribution of emotion types is imbalanced in the training and test data. In this case, it is more difficult for a model to learn to separate minority classes, resulting in those sometimes being ignored or frequently misclassified. Previous work has utilised class weighted loss for training, but problems remain as it sometimes causes over-fitting for minor classes or under-fitting for major classes. This paper presents the system developed by a multi-site team for the participation in the Odyssey 2024 Emotion Recognition Challenge Track-1. The challenge data has the aforementioned properties and therefore the presented systems aimed to tackle these issues, by introducing focal loss in optimisation when applying class weighted loss. Specifically, the focal loss is further…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVehicle License Plate Recognition
MethodsFocal Loss
