SOLVER: Scene-Object Interrelated Visual Emotion Reasoning Network
Jingyuan Yang, Xinbo Gao, Leida Li, Xiumei Wang, and Jinshan Ding

TL;DR
The paper introduces SOLVER, a novel network that models interactions between objects and scenes to improve visual emotion analysis, outperforming existing methods and providing interpretability.
Contribution
It proposes a scene-object interrelated reasoning network using an Emotion Graph and scene-based attention, advancing the accuracy and interpretability of visual emotion analysis.
Findings
Outperforms state-of-the-art methods on eight datasets
Demonstrates robustness across additional datasets
Provides interpretability through visualization
Abstract
Visual Emotion Analysis (VEA) aims at finding out how people feel emotionally towards different visual stimuli, which has attracted great attention recently with the prevalence of sharing images on social networks. Since human emotion involves a highly complex and abstract cognitive process, it is difficult to infer visual emotions directly from holistic or regional features in affective images. It has been demonstrated in psychology that visual emotions are evoked by the interactions between objects as well as the interactions between objects and scenes within an image. Inspired by this, we propose a novel Scene-Object interreLated Visual Emotion Reasoning network (SOLVER) to predict emotions from images. To mine the emotional relationships between distinct objects, we first build up an Emotion Graph based on semantic concepts and visual features. Then, we conduct reasoning on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
