From Codebooks to VLMs: Evaluating Automated Visual Discourse Analysis for Climate Change on Social Media

Katharina Prasse; Steffen Jung; Isaac Bravo; Stefanie Walter; Patrick Knab; Christian Bartelt; Margret Keuper

arXiv:2604.21786·cs.CV·April 24, 2026

From Codebooks to VLMs: Evaluating Automated Visual Discourse Analysis for Climate Change on Social Media

Katharina Prasse, Steffen Jung, Isaac Bravo, Stefanie Walter, Patrick Knab, Christian Bartelt, Margret Keuper

PDF

1 Repo

TL;DR

This paper evaluates the effectiveness of various vision-language models for analyzing climate change discourse on social media, demonstrating their potential for large-scale trend detection despite moderate per-image accuracy.

Contribution

It benchmarks multiple promptable vision-language models on climate-related social media images and advocates for distributional evaluation methods for discourse analysis.

Findings

01

Gemini-3.1-flash-lite outperforms other models across datasets.

02

Distributional evaluation reliably recovers population trends.

03

Prompt design enhances model performance.

Abstract

Social media platforms have become primary arenas for climate communication, generating millions of images and posts that - if systematically analysed - can reveal which communication strategies mobilise public concern and which fall flat. We aim to facilitate such research by analysing how computer vision methods can be used for social media discourse analysis. This analysis includes application-based taxonomy design, model selection, prompt engineering, and validation. We benchmark six promptable vision-language models and 15 zero-shot CLIP-like models on two datasets from X (formerly Twitter) - a 1,038-image expert-annotated set and a larger corpus of over 1.2 million images, with 50,000 labels manually validated - spanning five annotation dimensions: animal content, climate change consequences, climate action, image setting, and image type. Among the models benchmarked,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

KathPra/Codebooks2VLMs.git
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.