Causal Reasoning Favors Encoders: On The Limits of Decoder-Only Models

Amartya Roy; Elamparithy M; Kripabandhu Ghosh; Ponnurangam Kumaraguru; Adrian de Wynter

arXiv:2512.10561·cs.CL·December 12, 2025

Causal Reasoning Favors Encoders: On The Limits of Decoder-Only Models

Amartya Roy, Elamparithy M, Kripabandhu Ghosh, Ponnurangam Kumaraguru, Adrian de Wynter

PDF

Open Access

TL;DR

This paper investigates the effectiveness of encoder-based models versus decoder-only models for causal reasoning, finding that encoder architectures with finetuning outperform decoder-only models in robustness and generalization for multihop reasoning tasks.

Contribution

It demonstrates that encoder and encoder-decoder architectures are better suited for causal reasoning than decoder-only models, especially when finetuned, highlighting their advantages in robustness and generalization.

Findings

01

Encoder models outperform decoder-only models in causal reasoning tasks.

02

Finetuned encoder architectures generalize better across distributional shifts.

03

Decoder-only models excel only at large scales, but are less robust.

Abstract

In context learning (ICL) underpins recent advances in large language models (LLMs), although its role and performance in causal reasoning remains unclear. Causal reasoning demands multihop composition and strict conjunctive control, and reliance on spurious lexical relations of the input could provide misleading results. We hypothesize that, due to their ability to project the input into a latent space, encoder and encoder decoder architectures are better suited for said multihop conjunctive reasoning versus decoder only models. To do this, we compare fine-tuned versions of all the aforementioned architectures with zero and few shot ICL in both natural language and non natural language scenarios. We find that ICL alone is insufficient for reliable causal reasoning, often overfocusing on irrelevant input features. In particular, decoder only models are noticeably brittle to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Bayesian Modeling and Causal Inference · Topic Modeling