Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions

Zhe Hu; Tuo Liang; Jing Li; Yiren Lu; Yunlai Zhou; Yiran Qiao; Jing Ma; Yu Yin

arXiv:2405.19088·cs.CL·April 16, 2026

Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions

Zhe Hu, Tuo Liang, Jing Li, Yiren Lu, Yunlai Zhou, Yiran Qiao, Jing Ma, Yu Yin

PDF

1 Datasets 1 Video

TL;DR

This paper introduces the YesBut benchmark to evaluate AI's ability to understand humorous contradictions in comics, revealing current models' limitations compared to humans.

Contribution

It presents a new benchmark for assessing AI comprehension of humorous narratives involving nonlinear and contradictory content.

Findings

01

State-of-the-art models underperform humans in understanding humorous comics.

02

The benchmark covers tasks from literal comprehension to deep narrative reasoning.

03

Current models show significant room for improvement in grasping humor nuances.

Abstract

Recent advancements in large multimodal language models have demonstrated remarkable proficiency across a wide range of tasks. Yet, these models still struggle with understanding the nuances of human humor through juxtaposition, particularly when it involves nonlinear narratives that underpin many jokes and humor cues. This paper investigates this challenge by focusing on comics with contradictory narratives, where each comic consists of two panels that create a humorous contradiction. We introduce the YesBut benchmark, which comprises tasks of varying difficulty aimed at assessing AI's capabilities in recognizing and interpreting these comics, ranging from literal content comprehension to deep narrative reasoning. Through extensive experimentation and analysis of recent commercial or open-sourced large (vision) language models, we assess their capability to comprehend the complex…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

zhehuderek/YESBUT_Benchmark
dataset· 36 dl
36 dl

Videos

Cracking the Code of Juxtaposition: Can AI Models Understand the Humorous Contradictions· slideslive