MDAN: Multi-level Dependent Attention Network for Visual Emotion Analysis
Liwen Xu, Zhengtao Wang, Bin Wu, Simon Lui

TL;DR
This paper introduces MDAN, a hierarchical deep learning framework with attention modules for visual emotion analysis, effectively bridging the affective gap and achieving state-of-the-art results on multiple benchmarks.
Contribution
The paper proposes a novel multi-level dependent attention network that models emotion hierarchy and semantic-affective mapping for improved visual emotion analysis.
Findings
Achieves new state-of-the-art performance on six VEA benchmarks.
Outperforms existing methods by up to 3.85% accuracy.
Effectively models emotion hierarchy and semantic-affective relationships.
Abstract
Visual Emotion Analysis (VEA) is attracting increasing attention. One of the biggest challenges of VEA is to bridge the affective gap between visual clues in a picture and the emotion expressed by the picture. As the granularity of emotions increases, the affective gap increases as well. Existing deep approaches try to bridge the gap by directly learning discrimination among emotions globally in one shot without considering the hierarchical relationship among emotions at different affective levels and the affective level of emotions to be classified. In this paper, we present the Multi-level Dependent Attention Network (MDAN) with two branches, to leverage the emotion hierarchy and the correlation between different affective levels and semantic levels. The bottom-up branch directly learns emotions at the highest affective level and strictly follows the emotion hierarchy while predicting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Computing and Algorithms · Sentiment Analysis and Opinion Mining
MethodsDense Connections · Sigmoid Activation · Max Pooling · Average Pooling
