Lying Is Just a Phase: The Hidden Alignment Transition in Language Model Scaling

Adil Amin

arXiv:2605.18838·cs.LG·May 20, 2026

Lying Is Just a Phase: The Hidden Alignment Transition in Language Model Scaling

Adil Amin

PDF

1 Repo

TL;DR

This paper uncovers a hidden phase transition in language models where capabilities shift from anticorrelated to cooperative as model size crosses a critical threshold, influenced by architecture and training choices.

Contribution

It identifies a regime change in language models' capabilities coupling, introduces a diagnostic tool, and demonstrates how training and architecture affect this transition.

Findings

01

Coupling between reasoning and truthfulness shifts at a critical model size.

02

Data curation and architecture significantly influence the coupling phase.

03

Width normalization eliminates the anticorrelation across model families.

Abstract

Scaling laws predict loss from compute but not how capabilities interact. We measure the coupling between reasoning and truthfulness across 63 base models from 16 families and find a regime change invisible to loss curves: below a family-dependent critical scale $N_{c}$ , capabilities anticorrelate; above it, they cooperate. $N_{c} \approx 3.5$ B parameters [2.9B, 13.4B] (bootstrap 95% CI), but model size is not the only variable that determines phase. Architecture, data curation, and training recipe each shift $N_{c}$ independently: curated training eliminated the coupling dip between Qwen generations ( $0.025 \to 0.830$ at matched scale), Gemma-4 at 4B achieves coupling 0.871, characteristic of 13B+ standard-trained models, through distillation and architectural innovation, and Phi at 1B matches web-trained coupling at 10B through data curation alone. Width normalization eliminates the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://zehenlabs.com/cape
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.