Transfer Learning for Improving Speech Emotion Classification Accuracy

Siddique Latif; Rajib Rana; Shahzad Younis; Junaid Qadir; and Julien; Epps

arXiv:1801.06353·cs.CV·July 29, 2020·6 cites

Transfer Learning for Improving Speech Emotion Classification Accuracy

Siddique Latif, Rajib Rana, Shahzad Younis, Junaid Qadir, and Julien, Epps

PDF

Open Access 1 Repo

TL;DR

This paper introduces a transfer learning approach using Deep Belief Networks to enhance speech emotion recognition accuracy across different languages and corpora, addressing cross-corpus and cross-language challenges.

Contribution

It presents a novel application of transfer learning with DBNs for cross-language and cross-corpus speech emotion recognition, outperforming previous methods.

Findings

01

DBNs outperform previous approaches in cross-corpus recognition

02

Using multiple languages for training improves accuracy

03

Limited target data can still yield high accuracy with transfer learning

Abstract

The majority of existing speech emotion recognition research focuses on automatic emotion detection using training and testing data from same corpus collected under the same conditions. The performance of such systems has been shown to drop significantly in cross-corpus and cross-language scenarios. To address the problem, this paper exploits a transfer learning technique to improve the performance of speech emotion recognition systems that is novel in cross-language and cross-corpus scenarios. Evaluations on five different corpora in three different languages show that Deep Belief Networks (DBNs) offer better accuracy than previous approaches on cross-corpus emotion recognition, relative to a Sparse Autoencoder and SVM baseline system. Results also suggest that using a large number of languages for training and using a small fraction of the target data in training can significantly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

raulsteleac/Speech_Emotion_Recognition
tf

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEmotion and Mood Recognition · Sentiment Analysis and Opinion Mining · Speech Recognition and Synthesis

MethodsSparse Autoencoder · Solana Customer Service Number +1-833-534-1729 · Support Vector Machine