Memes-as-Replies: Can Models Select Humorous Manga Panel Responses?
Ryosuke Kohita, Seiichiro Yoshioka

TL;DR
This paper introduces a new benchmark for selecting humorous manga panel responses in social media conversations, revealing current models' limitations in understanding humor and visual context.
Contribution
The paper presents the Meme Reply Selection task and MaMe-Re benchmark, providing a large dataset and analysis of model performance in humor understanding.
Findings
LLMs can capture social cues like exaggeration.
Visual information does not enhance model performance.
Models struggle to differentiate subtle wit in responses.
Abstract
Memes are a popular element of modern web communication, used not only as static artifacts but also as interactive replies within conversations. While computational research has focused on analyzing the intrinsic properties of memes, the dynamic and contextual use of memes to create humor remains an understudied area of web science. To address this gap, we introduce the Meme Reply Selection task and present MaMe-Re (Manga Meme Reply Benchmark), a benchmark of 100,000 human-annotated pairs (500,000 total annotations from 2,325 unique annotators) consisting of openly licensed Japanese manga panels and social media posts. Our analysis reveals three key insights: (1) large language models (LLMs) show preliminary evidence of capturing complex social cues such as exaggeration, moving beyond surface-level semantic matching; (2) the inclusion of visual information does not improve performance,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHumor Studies and Applications · Language, Metaphor, and Cognition · Media Influence and Health
