Global Low-Rank, Local Full-Rank: The Holographic Encoding of Learned Algorithms
Yongzhong Xu

TL;DR
This paper reveals that neural networks encode learned algorithms through a globally low-rank, locally full-rank holographic structure, which explains the grokking phenomenon and challenges traditional low-rank assumptions.
Contribution
It introduces the holographic encoding principle, showing that learned algorithms are stored via coordinated updates across all parameters rather than localized low-rank structures.
Findings
Grokking trajectories are confined to a 2-6 dimensional global subspace.
Individual weight matrices remain effectively full-rank during learning.
Trajectory PCA recovers over 95% of final accuracy from few principal components.
Abstract
Grokking -- the abrupt transition from memorization to generalization after extended training -- has been linked to the emergence of low-dimensional structure in learning dynamics. Yet neural network parameters inhabit extremely high-dimensional spaces. How can a low-dimensional learning process produce solutions that resist low-dimensional compression? We investigate this question in multi-task modular arithmetic, training shared-trunk Transformers with separate heads for addition, multiplication, and a quadratic operation modulo 97. Across three model scales (315K--2.2M parameters) and five weight decay settings, we compare three reconstruction methods: per-matrix SVD, joint cross-matrix SVD, and trajectory PCA. Across all conditions, grokking trajectories are confined to a 2--6 dimensional global subspace, while individual weight matrices remain effectively full-rank.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Stochastic Gradient Optimization Techniques · Quantum many-body systems
