Automatically augmenting an emotion dataset improves classification   using audio

Egor Lakomkin; Cornelius Weber; Stefan Wermter

arXiv:1803.11506·cs.CL·April 2, 2018

Automatically augmenting an emotion dataset improves classification using audio

Egor Lakomkin, Cornelius Weber, Stefan Wermter

PDF

TL;DR

This paper presents a method to automatically augment speech emotion datasets by extracting audio from movies using sentiment analysis, improving emotion classification performance on the EmotiW corpus.

Contribution

It introduces a novel approach combining textual sentiment analysis with audio data extraction to enhance emotion dataset size and classifier accuracy.

Findings

01

Pretraining on augmented dataset improves classification accuracy.

02

Combining textual sentiment analysis with audio data benefits emotion recognition.

03

Method enhances generalization in speech emotion classification.

Abstract

In this work, we tackle a problem of speech emotion classification. One of the issues in the area of affective computation is that the amount of annotated data is very limited. On the other hand, the number of ways that the same emotion can be expressed verbally is enormous due to variability between speakers. This is one of the factors that limits performance and generalization. We propose a simple method that extracts audio samples from movies using textual sentiment analysis. As a result, it is possible to automatically construct a larger dataset of audio samples with positive, negative emotional and neutral speech. We show that pretraining recurrent neural network on such a dataset yields better results on the challenging EmotiW corpus. This experiment shows a potential benefit of combining textual sentiment analysis with vocal information.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.