Transfer Learning for Improving Speech Emotion Classification Accuracy
Siddique Latif, Rajib Rana, Shahzad Younis, Junaid Qadir, and Julien, Epps

TL;DR
This paper introduces a transfer learning approach using Deep Belief Networks to enhance speech emotion recognition accuracy across different languages and corpora, addressing cross-corpus and cross-language challenges.
Contribution
It presents a novel application of transfer learning with DBNs for cross-language and cross-corpus speech emotion recognition, outperforming previous methods.
Findings
DBNs outperform previous approaches in cross-corpus recognition
Using multiple languages for training improves accuracy
Limited target data can still yield high accuracy with transfer learning
Abstract
The majority of existing speech emotion recognition research focuses on automatic emotion detection using training and testing data from same corpus collected under the same conditions. The performance of such systems has been shown to drop significantly in cross-corpus and cross-language scenarios. To address the problem, this paper exploits a transfer learning technique to improve the performance of speech emotion recognition systems that is novel in cross-language and cross-corpus scenarios. Evaluations on five different corpora in three different languages show that Deep Belief Networks (DBNs) offer better accuracy than previous approaches on cross-corpus emotion recognition, relative to a Sparse Autoencoder and SVM baseline system. Results also suggest that using a large number of languages for training and using a small fraction of the target data in training can significantly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Sentiment Analysis and Opinion Mining · Speech Recognition and Synthesis
MethodsSparse Autoencoder · Solana Customer Service Number +1-833-534-1729 · Support Vector Machine
