GrokAlign: Geometric Characterisation and Acceleration of Grokking
Thomas Walker, Ahmed Imtiaz Humayun, Randall Balestriero, Richard Baraniuk

TL;DR
This paper introduces GrokAlign, a Jacobian regularization method that accelerates grokking in deep networks by aligning Jacobians with training data, supported by theoretical insights and empirical results.
Contribution
It proposes Jacobian alignment as a novel regularization technique to induce grokking faster, along with a simplified centroid alignment method for analyzing training dynamics.
Findings
Jacobian alignment ensures grokking under low-rank assumptions
GrokAlign accelerates grokking compared to weight decay
Centroid alignment tracks training stages effectively
Abstract
A key challenge for the machine learning community is to understand and accelerate the training dynamics of deep networks that lead to delayed generalisation and emergent robustness to input perturbations, also known as grokking. Prior work has associated phenomena like delayed generalisation with the transition of a deep network from a linear to a feature learning regime, and emergent robustness with changes to the network's functional geometry, in particular the arrangement of the so-called linear regions in deep networks employing continuous piecewise affine nonlinearities. Here, we explain how grokking is realised in the Jacobian of a deep network and demonstrate that aligning a network's Jacobians with the training data (in the sense of cosine similarity) ensures grokking under a low-rank Jacobian assumption. Our results provide a strong theoretical motivation for the use of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Materials and Mechanics · Interactive and Immersive Displays · Architecture and Computational Design
