Analyzing Image Beyond Visual Aspect: Image Emotion Classification via Multiple-Affective Captioning
Zibo Zhou, Zhengjun Zhai, Huimin Chen, Wei Dai, Hansen Yang

TL;DR
This paper introduces a novel affective captioning approach for image emotion classification that leverages language to bridge the affective gap, utilizing hierarchical contrastive loss and emotional reasoning to improve accuracy.
Contribution
The paper proposes a new method combining affective captioning, contrastive learning, and language models to enhance image emotion classification beyond visual features.
Findings
Achieves superior results on multiple benchmarks.
Effectively bridges the affective gap in image emotion classification.
Incorporates embedded text in images for improved analysis.
Abstract
Image emotion classification (IEC) is a longstanding research field that has received increasing attention with the rapid progress of deep learning. Although recent advances have leveraged the knowledge encoded in pre-trained visual models, their effectiveness is constrained by the "affective gap" , limits the applicability of pre-training knowledge for IEC tasks. It has been demonstrated in psychology that language exhibits high variability, encompasses diverse and abundant information, and can effectively eliminate the "affective gap". Inspired by this, we propose a novel Affective Captioning for Image Emotion Classification (ACIEC) to classify image emotion based on pure texts, which effectively capture the affective information in the image. In our method, a hierarchical multi-level contrastive loss is designed for detecting emotional concepts from images, while an emotional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Multimodal Machine Learning Applications · Emotion and Mood Recognition
