TL;DR
This paper uncovers a hidden phase transition in language models where capabilities shift from anticorrelated to cooperative as model size crosses a critical threshold, influenced by architecture and training choices.
Contribution
It identifies a regime change in language models' capabilities coupling, introduces a diagnostic tool, and demonstrates how training and architecture affect this transition.
Findings
Coupling between reasoning and truthfulness shifts at a critical model size.
Data curation and architecture significantly influence the coupling phase.
Width normalization eliminates the anticorrelation across model families.
Abstract
Scaling laws predict loss from compute but not how capabilities interact. We measure the coupling between reasoning and truthfulness across 63 base models from 16 families and find a regime change invisible to loss curves: below a family-dependent critical scale , capabilities anticorrelate; above it, they cooperate. B parameters [2.9B, 13.4B] (bootstrap 95% CI), but model size is not the only variable that determines phase. Architecture, data curation, and training recipe each shift independently: curated training eliminated the coupling dip between Qwen generations ( at matched scale), Gemma-4 at 4B achieves coupling 0.871, characteristic of 13B+ standard-trained models, through distillation and architectural innovation, and Phi at 1B matches web-trained coupling at 10B through data curation alone. Width normalization eliminates the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
