Visual Affect Analysis: Predicting Emotions of Image Viewers with Vision-Language Models
Filip Nowicki, Hubert Marciniak, Jakub {\L}\k{a}czkowski, Krzysztof Jassem, Tomasz G\'orecki, Vimala Balakrishnan, Desmond C. Ong, Maciej Behnke

TL;DR
This study benchmarks vision-language models on affective image datasets, showing they can predict broad emotional trends but lack nuanced accuracy compared to human ratings, with implications for affective computing.
Contribution
It provides a comprehensive evaluation of nine VLMs on validated affective datasets, highlighting their strengths and limitations in emotion prediction tasks.
Findings
Models achieved 60-80% accuracy in discrete emotion classification.
Predictions of anger and surprise were less accurate.
Moderate to strong correlation (r > 0.75) with human ratings in continuous predictions.
Abstract
Vision-language models (VLMs) show promise as tools for inferring affect from visual stimuli at scale; it is not yet clear how closely their outputs align with human affective ratings. We benchmarked nine VLMs, ranging from state-of-the-art proprietary models to open-source models, on three psycho-metrically validated affective image datasets: the International Affective Picture System, the Nencki Affective Picture System, and the Library of AI-Generated Affective Images. The models performed two tasks in the zero-shot setting: (i) top-emotion classification (selecting the strongest discrete emotion elicited by an image) and (ii) continuous prediction of human ratings on 1-7/9 Likert scales for discrete emotion categories and affective dimensions. We also evaluated the impact of rater-conditioned prompting on the LAI-GAI dataset using de-identified participant metadata. The results show…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmotion and Mood Recognition · Sentiment Analysis and Opinion Mining · Emotions and Moral Behavior
