Flying Pigs, FaR and Beyond: Evaluating LLM Reasoning in Counterfactual Worlds

Anish R Joishy; Ishwar B Balappanawar; Vamshi Krishna Bonagiri; Manas Gaur; Krishnaprasad Thirunarayan; Ponnurangam Kumaraguru

arXiv:2505.22318·cs.CL·March 25, 2026

Flying Pigs, FaR and Beyond: Evaluating LLM Reasoning in Counterfactual Worlds

Anish R Joishy, Ishwar B Balappanawar, Vamshi Krishna Bonagiri, Manas Gaur, Krishnaprasad Thirunarayan, Ponnurangam Kumaraguru

PDF

Open Access

TL;DR

This paper evaluates how well large language models reason in counterfactual scenarios where their knowledge conflicts with the context, introduces a new benchmark, and proposes a metacognitive intervention to improve reasoning accuracy.

Contribution

It introduces CounterLogic, a benchmark for testing LLM reasoning in counterfactual worlds, and proposes Flag & Reason (FaR), a simple intervention that enhances model robustness.

Findings

01

LLMs' accuracy drops by 14% in counterfactual scenarios.

02

Flag & Reason reduces the performance gap to 7%.

03

Metacognitive prompting improves overall reasoning accuracy.

Abstract

A fundamental challenge in reasoning is navigating hypothetical, counterfactual worlds where logic may conflict with ingrained knowledge. We investigate this frontier for Large Language Models (LLMs) by asking: Can LLMs reason logically when the context contradicts their parametric knowledge? To facilitate a systematic analysis, we first introduce CounterLogic, a benchmark specifically designed to disentangle logical validity from knowledge alignment. Evaluation of 11 LLMs across six diverse reasoning datasets reveals a consistent failure: model accuracy plummets by an average of 14% in counterfactual scenarios compared to knowledge-aligned ones. We hypothesize that this gap stems not from a flaw in logical processing, but from an inability to manage the cognitive conflict between context and knowledge. Inspired by human metacognition, we propose a simple yet powerful intervention: Flag…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)