How and Why LLMs Generalize: A Fine-Grained Analysis of LLM Reasoning from Cognitive Behaviors to Low-Level Patterns

Haoyue Bai; Yiyou Sun; Wenjie Hu; Shi Qiu; Maggie Ziyu Huan; Peiyang Song; Robert Nowak; Dawn Song

arXiv:2512.24063·cs.LG·January 1, 2026

How and Why LLMs Generalize: A Fine-Grained Analysis of LLM Reasoning from Cognitive Behaviors to Low-Level Patterns

Haoyue Bai, Yiyou Sun, Wenjie Hu, Shi Qiu, Maggie Ziyu Huan, Peiyang Song, Robert Nowak, Dawn Song

PDF

Open Access

TL;DR

This paper introduces a detailed framework to analyze how large language models generalize reasoning skills during training, revealing that reinforcement learning preserves reasoning abilities better than supervised fine-tuning.

Contribution

It presents a novel benchmark decomposing reasoning into atomic skills and a meta-probing framework to analyze model behavior across training stages.

Findings

01

RL-tuned models maintain stable reasoning behaviors

02

SFT models show sharper drift and overfitting

03

The framework enables granular analysis of reasoning skill development

Abstract

Large Language Models (LLMs) display strikingly different generalization behaviors: supervised fine-tuning (SFT) often narrows capability, whereas reinforcement-learning (RL) tuning tends to preserve it. The reasons behind this divergence remain unclear, as prior studies have largely relied on coarse accuracy metrics. We address this gap by introducing a novel benchmark that decomposes reasoning into atomic core skills such as calculation, fact retrieval, simulation, enumeration, and diagnostic, providing a concrete framework for addressing the fundamental question of what constitutes reasoning in LLMs. By isolating and measuring these core skills, the benchmark offers a more granular view of how specific cognitive abilities emerge, transfer, and sometimes collapse during post-training. Combined with analyses of low-level statistical patterns such as distributional divergence and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Text Readability and Simplification