MemeReaCon: Probing Contextual Meme Understanding in Large Vision-Language Models

Zhengyi Zhao; Shubo Zhang; Yuxi Zhang; Yanxi Zhao; Yifan Zhang; Zezhong Wang; Huimin Wang; Yutian Zhao; Bin Liang; Yefeng Zheng; Binyang Li; Kam-Fai Wong; Xian Wu

arXiv:2505.17433·cs.AI·June 5, 2025

MemeReaCon: Probing Contextual Meme Understanding in Large Vision-Language Models

Zhengyi Zhao, Shubo Zhang, Yuxi Zhang, Yanxi Zhao, Yifan Zhang, Zezhong Wang, Huimin Wang, Yutian Zhao, Bin Liang, Yefeng Zheng, Binyang Li, Kam-Fai Wong, Xian Wu

PDF

TL;DR

MemeReaCon introduces a benchmark to evaluate large vision-language models' ability to understand memes within their original conversational context, highlighting current models' limitations in interpreting context-dependent meme intent.

Contribution

We created MemeReaCon, a new benchmark dataset that assesses how well LVLMs understand memes in context, addressing a key gap in current multimodal understanding research.

Findings

01

LVLMs struggle with context-dependent meme interpretation

02

Models tend to focus on visual details over communicative intent

03

MemeReaCon exposes significant limitations in current LVLMs' contextual understanding

Abstract

Memes have emerged as a popular form of multimodal online communication, where their interpretation heavily depends on the specific context in which they appear. Current approaches predominantly focus on isolated meme analysis, either for harmful content detection or standalone interpretation, overlooking a fundamental challenge: the same meme can express different intents depending on its conversational context. This oversight creates an evaluation gap: although humans intuitively recognize how context shapes meme interpretation, Large Vision Language Models (LVLMs) can hardly understand context-dependent meme intent. To address this critical limitation, we introduce MemeReaCon, a novel benchmark specifically designed to evaluate how LVLMs understand memes in their original context. We collected memes from five different Reddit communities, keeping each meme's image, the post text, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.