Stimuli-Aware Visual Emotion Analysis
Jingyuan Yang, Jie Li, Xiumei Wang, Yuxuan Ding, Xinbo Gao

TL;DR
This paper introduces a stimuli-aware approach to visual emotion analysis that selects specific emotional stimuli from images and employs specialized networks to extract features, leading to improved accuracy over existing methods.
Contribution
It is the first to incorporate a stimuli selection process into VEA within an end-to-end network, utilizing psychological theory and novel hierarchical loss for better emotion recognition.
Findings
Outperforms state-of-the-art on four datasets
Effective stimuli selection improves emotion prediction accuracy
Hierarchical loss enhances model robustness and interpretability
Abstract
Visual emotion analysis (VEA) has attracted great attention recently, due to the increasing tendency of expressing and understanding emotions through images on social networks. Different from traditional vision tasks, VEA is inherently more challenging since it involves a much higher level of complexity and ambiguity in human cognitive process. Most of the existing methods adopt deep learning techniques to extract general features from the whole image, disregarding the specific features evoked by various emotional stimuli. Inspired by the \textit{Stimuli-Organism-Response (S-O-R)} emotion model in psychological theory, we proposed a stimuli-aware VEA method consisting of three stages, namely stimuli selection (S), feature extraction (O) and emotion prediction (R). First, specific emotional stimuli (i.e., color, object, face) are selected from images by employing the off-the-shelf tools.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
