Enhancing the Prediction of Emotional Experience in Movies using Deep Neural Networks: The Significance of Audio and Language
Sogand Mehrpour Mohammadi, Meysam Gouran Orimi, Hamidreza Rabiee

TL;DR
This study employs deep neural networks to predict movie-induced emotions by integrating visual, auditory, and linguistic data, revealing the significant roles of language and sound in emotional prediction.
Contribution
It is the first comprehensive model combining all three modalities—visual, audio, and language—for emotion prediction in movies, demonstrating their relative importance.
Findings
Language significantly influences arousal prediction.
Sound is the primary determinant for valence.
Visual cues have the least impact on emotion prediction.
Abstract
Our paper focuses on making use of deep neural network models to accurately predict the range of human emotions experienced during watching movies. In this certain setup, there exist three clear-cut input modalities that considerably influence the experienced emotions: visual cues derived from RGB video frames, auditory components encompassing sounds, speech, and music, and linguistic elements encompassing actors' dialogues. Emotions are commonly described using a two-factor model including valence (ranging from happy to sad) and arousal (indicating the intensity of the emotion). In this regard, a Plethora of works have presented a multitude of models aiming to predict valence and arousal from video content. However, non of these models contain all three modalities, with language being consistently eliminated across all of them. In this study, we comprehensively combine all modalities…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Signal Denoising Methods · Music and Audio Processing · Media Influence and Health
MethodsNetwork On Network
