Coherent Without Grounding, Grounded Without Success: Observability and Epistemic Failure
Camilo Chac\'on Sartori

TL;DR
This paper challenges assumptions about understanding in LLMs by showing that observable success and explanation coherence can be dissociated, necessitating a new framework for evaluating artificial epistemic agents.
Contribution
It introduces the Bidirectional Coherence Paradox and the Epistemic Triangle model, highlighting the dissociation between success and explanation in LLMs across different observability domains.
Findings
LLMs can act successfully while misidentifying mechanisms in low-observability domains.
LLMs can generate explanations that are accurate but do not lead to effective intervention in high-observability domains.
Explanatory coherence alone does not indicate genuine understanding in LLMs.
Abstract
When an agent can articulate why something works, we typically take this as evidence of genuine understanding. This presupposes that effective action and correct explanation covary, and that coherent explanation reliably signals both. I argue that this assumption fails for contemporary Large Language Models (LLMs). I introduce what I call the Bidirectional Coherence Paradox: competence and grounding not only dissociate but invert across epistemic conditions. In low-observability domains, LLMs often act successfully while misidentifying the mechanisms that produce their success. In high-observability domains, they frequently generate explanations that accurately track observable causal structure yet fail to translate those diagnoses into effective intervention. In both cases, explanatory coherence remains intact, obscuring the underlying dissociation. Drawing on experiments in compiler…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
