Absurd World: A Simple Yet Powerful Method to Absurdify the Real-world for Probing LLM Reasoning Capabilities

Ryan Albright; Golam Md Muktadir; Zarif Ikram; S M Jubaer; Mehrab Hossain; Dianbo Liu

arXiv:2605.09678·cs.AI·May 12, 2026

Absurd World: A Simple Yet Powerful Method to Absurdify the Real-world for Probing LLM Reasoning Capabilities

Ryan Albright, Golam Md Muktadir, Zarif Ikram, S M Jubaer, Mehrab Hossain, Dianbo Liu

PDF

TL;DR

Absurd World is a benchmarking framework that tests LLM reasoning by creating absurd scenarios with preserved logic, revealing their ability to think logically beyond learned real-world patterns.

Contribution

The paper introduces Absurd World, a novel benchmarking method for evaluating LLM reasoning robustness using logically coherent absurd scenarios.

Findings

01

LLMs struggle with absurd scenarios despite logical coherence.

02

Prompting techniques influence LLM performance on absurd tasks.

03

Absurd World effectively isolates reasoning ability from real-world pattern learning.

Abstract

While extremely powerful and versatile at various tasks, the thinking capabilities of large language models (LLMs) are often put under scrutiny as they sometimes fail to solve problems that humans can systematically solve. However, recent literature focuses on breaking LLM reasoning with increasingly complex problems, and whether an LLM is robust in simple logical reasoning remains underexplored. This paper proposes Absurd World, a benchmarking framework, to test LLMs against altered realism, where scenarios are logically coherent, and humans can easily solve the tasks. Absurd World breaks a real-world model into symbols, actions, sequences, and events, which are automatically altered to create absurd worlds where the logic to solve the tasks remains the same. It evaluates a large collection of models with simple and advanced prompting techniques, and proves that it is an effective tool…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.