Spectral Archaeology: The Causal Topology of Model Evolution
Valentin No\"el

TL;DR
This paper introduces a spectral graph analysis method to understand model evolution, revealing hidden structural changes and trade-offs in neural networks across different training stages and languages.
Contribution
It presents a training-free spectral probe that uncovers mechanistic insights and stability patterns in model behavior not visible through standard benchmarks.
Findings
Spectral fingerprints expose discontinuities missed by traditional evaluation.
PTCC indicates brittle curriculum shifts affecting connectivity.
Activation steering can partially recover lost information flow.
Abstract
Behavioral benchmarks tell us \textit{what} a model does, but not \textit{how}. We introduce a training-free mechanistic probe using attention-graph spectra. Treating each layer as a token graph, we compute algebraic connectivity (), smoothness, and spectral entropy. Across 12 models and 10 languages, these measures yield stable ``spectral fingerprints'' that expose discontinuities missed by standard evaluation. We report four results. (1) Models undergoing specific curriculum transitions (e.g., code-to-chat) show an English-only, syntax-triggered connectivity failure on non-canonical constructions, reaching . We term this scar \textit{Passive-Triggered Connectivity Collapse} (PTCC). Analysis of the Phi lineage reveals that PTCC appears and resolves across developmental stages, implicating brittle curriculum shifts rather than synthetic data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage and cultural evolution · Ferroelectric and Negative Capacitance Devices · Face Recognition and Perception
