Recognizing Emotions evoked by Movies using Multitask Learning
Hassan Hayat, Carles Ventura, Agata Lapedriza

TL;DR
This paper introduces a multi-task learning approach to recognize emotions evoked by movies, modeling individual viewer emotions alongside aggregated annotations, leading to improved accuracy over existing methods.
Contribution
The paper proposes a novel multi-task deep learning architecture that jointly models individual and aggregated viewer emotions, outperforming traditional single-task models and state-of-the-art benchmarks.
Findings
Multi-task approach more accurately models individual and aggregated emotions.
Outperforms state-of-the-art on COGNIMUSE benchmark.
Demonstrates the effectiveness of joint modeling for affective movie analysis.
Abstract
Understanding the emotional impact of movies has become important for affective movie analysis, ranking, and indexing. Methods for recognizing evoked emotions are usually trained on human annotated data. Concretely, viewers watch video clips and have to manually annotate the emotions they experienced while watching the videos. Then, the common practice is to aggregate the different annotations, by computing average scores or majority voting, and train and test models on these aggregated annotations. With this procedure a single aggregated evoked emotion annotation is obtained per each video. However, emotions experienced while watching a video are subjective: different individuals might experience different emotions. In this paper, we model the emotions evoked by videos in a different manner: instead of modeling the aggregated value we jointly model the emotions experienced by each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Human Pose and Action Recognition · Video Analysis and Summarization
