A Multi-component CNN-RNN Approach for Dimensional Emotion Recognition   in-the-wild

Dimitrios Kollias; Stefanos Zafeiriou

arXiv:1805.01452·cs.CV·December 17, 2019·27 cites

A Multi-component CNN-RNN Approach for Dimensional Emotion Recognition in-the-wild

Dimitrios Kollias, Stefanos Zafeiriou

PDF

Open Access

TL;DR

This paper introduces a multi-component CNN-RNN deep learning model for dimensional emotion recognition in videos, achieving improved accuracy on the OMG-Emotion Challenge dataset.

Contribution

It develops an extended CNN-RNN architecture that combines multiple features for better emotion dimension estimation in-the-wild videos.

Findings

01

Achieved best performance on valence and arousal estimation tasks.

02

Optimized architecture for the OMG-Emotion validation dataset.

03

Demonstrated effectiveness of multi-feature CNN-RNN approach.

Abstract

This paper presents our approach to the One-Minute Gradual-Emotion Recognition (OMG-Emotion) Challenge, focusing on dimensional emotion recognition through visual analysis of the provided emotion videos. The approach is based on a Convolutional and Recurrent (CNN-RNN) deep neural architecture we have developed for the relevant large AffWild Emotion Database. We extended and adapted this architecture, by letting a combination of multiple features generated in the CNN component be explored by RNN subnets. Our target has been to obtain best performance on the OMG-Emotion visual validation data set, while learning the respective visual training data set. Extended experimentation has led to best architectures for the estimation of the values of the valence and arousal emotion dimensions over these data sets.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEmotion and Mood Recognition · Human Pose and Action Recognition · Face and Expression Recognition