TRACE for Tracking the Emergence of Semantic Representations in Transformers

Nura Aljaafari; Danilo S. Carvalho; Andr\'e Freitas

arXiv:2505.17998·cs.CL·May 26, 2025

TRACE for Tracking the Emergence of Semantic Representations in Transformers

Nura Aljaafari, Danilo S. Carvalho, Andr\'e Freitas

PDF

TL;DR

This paper introduces TRACE, a diagnostic framework that tracks the emergence of linguistic abstractions in transformer models by analyzing geometric, informational, and linguistic signals during training, revealing phase transitions linked to abstraction development.

Contribution

The paper presents TRACE, a novel method combining multiple signals and a synthetic data generator to study how linguistic structures emerge in transformers over training.

Findings

01

Phase transitions align with curvature collapse and dimension stabilization.

02

Emerging syntactic and semantic accuracy coincides with geometric shifts.

03

Abstraction patterns are consistent across different model architectures.

Abstract

Modern transformer models exhibit phase transitions during training, distinct shifts from memorisation to abstraction, but the mechanisms underlying these transitions remain poorly understood. Prior work has often focused on endpoint representations or isolated signals like curvature or mutual information, typically in symbolic or arithmetic domains, overlooking the emergence of linguistic structure. We introduce TRACE (Tracking Representation Abstraction and Compositional Emergence), a diagnostic framework combining geometric, informational, and linguistic signals to detect phase transitions in Transformer-based LMs. TRACE leverages a frame-semantic data generation method, ABSynth, that produces annotated synthetic corpora with controllable complexity, lexical distributions, and structural entropy, while being fully annotated with linguistic categories, enabling precise analysis of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.