World model inspired sarcasm reasoning with large language model agents

Keito Inoshita; Shinnosuke Mizuno

arXiv:2512.24329·cs.CL·May 1, 2026

World model inspired sarcasm reasoning with large language model agents

Keito Inoshita, Shinnosuke Mizuno

PDF

TL;DR

This paper introduces WM-SAR, a novel sarcasm detection method that uses specialized LLM-based agents to explicitly model and quantify semantic inconsistency and intention, improving accuracy and interpretability.

Contribution

The work reformulates sarcasm understanding as a world model inspired reasoning process with explicit component decomposition and a lightweight inference model, advancing interpretability and performance.

Findings

01

WM-SAR outperforms existing methods on sarcasm detection benchmarks.

02

Explicit modeling of semantic inconsistency and intention improves detection accuracy.

03

Ablation studies confirm the importance of component integration for effectiveness.

Abstract

Sarcasm understanding is a challenging problem in natural language processing, as it requires capturing the discrepancy between the surface meaning of an utterance and the speaker's intentions as well as the surrounding social context. Although recent advances in deep learning and Large Language Models (LLMs) have substantially improved performance, most existing approaches still rely on black-box predictions of a single model, making it difficult to structurally explain the cognitive factors underlying sarcasm. Moreover, while sarcasm often emerges as a mismatch between semantic evaluation and normative expectations or intentions, frameworks that explicitly decompose and model these components remain limited. In this work, we reformulate sarcasm understanding as a world model inspired reasoning process and propose World Model inspired SArcasm Reasoning (WM-SAR), which decomposes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.