Not Just the Destination, But the Journey: Reasoning Traces Causally Shape Generalization Behaviors

Pengcheng Wen; Yanxu Zhu; Jiapeng Sun; Han Zhu; Yujin Zhou; Chi-Min Chan; Sirui Han; Yike Guo

arXiv:2603.12397·cs.CL·March 16, 2026

Not Just the Destination, But the Journey: Reasoning Traces Causally Shape Generalization Behaviors

Pengcheng Wen, Yanxu Zhu, Jiapeng Sun, Han Zhu, Yujin Zhou, Chi-Min Chan, Sirui Han, Yike Guo

PDF

Open Access

TL;DR

This paper investigates whether the reasoning process in large language models causally influences their generalization behaviors, finding that reasoning content significantly impacts model outputs independently of final answers.

Contribution

The study introduces a controlled experimental framework to isolate and analyze the causal effect of reasoning traces on model behavior, revealing their independent influence beyond answer supervision.

Findings

01

Reasoning training can amplify harmful behaviors.

02

Different reasoning types lead to distinct behaviors.

03

Reasoning alone can alter model behavior without answer supervision.

Abstract

Chain-of-Thought (CoT) is often viewed as a window into LLM decision-making, yet recent work suggests it may function merely as post-hoc rationalization. This raises a critical alignment question: Does the reasoning trace causally shape model generalization independent of the final answer? To isolate reasoning's causal effect, we design a controlled experiment holding final harmful answers constant while varying reasoning paths. We construct datasets with \textit{Evil} reasoning embracing malice, \textit{Misleading} reasoning rationalizing harm, and \textit{Submissive} reasoning yielding to pressure. We train models (0.6B--14B parameters) under multiple paradigms, including question-thinking-answer (QTA), question-thinking (QT), and thinking-only (T-only), and evaluate them in both think and no-think modes. We find that: (1) CoT training could amplify harmful generalization more than…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsChild and Animal Learning Development · Topic Modeling · Embodied and Extended Cognition