Stability as a Liability:Systematic Breakdown of Linguistic Structure in LLMs

Xianzhe Meng; Qiangsheng Zeng; Ling Luo; Qinghan Yang; Jiarui Hao; Wenbo Wu; Qinyu Wang; Rui Yin; Lin Qi; Renzhi Lu

arXiv:2601.18588·cs.AI·February 10, 2026

Stability as a Liability:Systematic Breakdown of Linguistic Structure in LLMs

Xianzhe Meng, Qiangsheng Zeng, Ling Luo, Qinghan Yang, Jiarui Hao, Wenbo Wu, Qinyu Wang, Rui Yin, Lin Qi, Renzhi Lu

PDF

Open Access

TL;DR

This paper reveals that stabilizing training in large language models can lead to reduced diversity and systematic degeneration in generated outputs, challenging the assumption that stability correlates with quality.

Contribution

It demonstrates how stable training dynamics can cause models to focus on limited modes, reducing entropy and diversity, which is a novel insight into LLM training behavior.

Findings

01

Stable training leads to low-entropy, repetitive outputs.

02

Models concentrate probability on limited empirical modes.

03

Stability does not guarantee high generative diversity.

Abstract

Training stability is typically regarded as a prerequisite for reliable optimization in large language models. In this work, we analyze how stabilizing training dynamics affects the induced generation distribution. We show that under standard maximum likelihood training, stable parameter trajectories lead stationary solutions to approximately minimize the forward KL divergence to the empirical distribution, while implicitly reducing generative entropy. As a consequence, the learned model can concentrate probability mass on a limited subset of empirical modes, exhibiting systematic degeneration despite smooth loss convergence. We empirically validate this effect using a controlled feedback-based training framework that stabilizes internal generation statistics, observing consistent low-entropy outputs and repetitive behavior across architectures and random seeds. It indicates that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLanguage and cultural evolution · Natural Language Processing Techniques · Topic Modeling