TRACE for Tracking the Emergence of Semantic Representations in Transformers
Nura Aljaafari, Danilo S. Carvalho, Andr\'e Freitas

TL;DR
This paper introduces TRACE, a diagnostic framework that tracks the emergence of linguistic abstractions in transformer models by analyzing geometric, informational, and linguistic signals during training, revealing phase transitions linked to abstraction development.
Contribution
The paper presents TRACE, a novel method combining multiple signals and a synthetic data generator to study how linguistic structures emerge in transformers over training.
Findings
Phase transitions align with curvature collapse and dimension stabilization.
Emerging syntactic and semantic accuracy coincides with geometric shifts.
Abstraction patterns are consistent across different model architectures.
Abstract
Modern transformer models exhibit phase transitions during training, distinct shifts from memorisation to abstraction, but the mechanisms underlying these transitions remain poorly understood. Prior work has often focused on endpoint representations or isolated signals like curvature or mutual information, typically in symbolic or arithmetic domains, overlooking the emergence of linguistic structure. We introduce TRACE (Tracking Representation Abstraction and Compositional Emergence), a diagnostic framework combining geometric, informational, and linguistic signals to detect phase transitions in Transformer-based LMs. TRACE leverages a frame-semantic data generation method, ABSynth, that produces annotated synthetic corpora with controllable complexity, lexical distributions, and structural entropy, while being fully annotated with linguistic categories, enabling precise analysis of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
