The Geometric Anatomy of Capability Acquisition in Transformers
Jayadev Billa

TL;DR
This paper investigates how geometric changes in transformer representations relate to capability acquisition, revealing that geometric measures often precede behavioral improvements, especially in hard tasks.
Contribution
It introduces a detailed analysis of geometric measures across multiple transformer sizes and tasks, highlighting ankme as a reliable precursor to capability acquisition.
Findings
Representations first collapse to low-dimensional states then recover before behavior improves.
Linear probes show task-relevant information exists before the model acts on it.
Only ankme reliably predicts capability acquisition, especially in hard tasks.
Abstract
Neural networks gain capabilities during training, but the internal changes that precede capability acquisition are not well understood. In particular, the relationship between geometric change and behavioral change, and the effect of task difficulty and model scale on that relationship, is unclear. We track geometric measures and linear probes across six transformer sizes (405K--151M parameters), eight algorithmic tasks (144 tasklevelmodel combinations), and three Pythia language models (160M--2.8B). Across all settings, representations first collapse to a low-dimensional state, then recover, and only then does behavioral performance improve. Linear probes show that the model's hidden states already contain task-relevant information before the model can act on it. The collapse floor is task-specific, the collapse propagates top-down through the network, and of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
