TL;DR
Affect2MM is a novel method that models temporal emotion causality in multimedia content using attention and Granger causality, improving emotion prediction accuracy across multiple datasets.
Contribution
It introduces a new affective modeling approach combining emotion causality theories with attention-based and Granger causality methods for multimedia analysis.
Findings
Achieved 10-15% performance improvement over state-of-the-art methods.
Effectively models temporal emotion causality in multimedia content.
Demonstrated robustness across three diverse datasets.
Abstract
We present Affect2MM, a learning method for time-series emotion prediction for multimedia content. Our goal is to automatically capture the varying emotions depicted by characters in real-life human-centric situations and behaviors. We use the ideas from emotion causation theories to computationally model and determine the emotional state evoked in clips of movies. Affect2MM explicitly models the temporal causality using attention-based methods and Granger causality. We use a variety of components like facial features of actors involved, scene understanding, visual aesthetics, action/situation description, and movie script to obtain an affective-rich representation to understand and perceive the scene. We use an LSTM-based learning model for emotion perception. To evaluate our method, we analyze and compare our performance on three datasets, SENDv1, MovieGraphs, and the LIRIS-ACCEDE…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
