
TL;DR
The paper introduces ILDR, a geometric metric based on neural network representations, that detects the grokking generalization transition early, enabling more efficient training and better understanding of representation space changes.
Contribution
ILDR provides a novel, early geometric detection signal for grokking, requiring no eigendecomposition and evaluated solely on held-out data, improving training efficiency.
Findings
ILDR crosses a threshold before validation accuracy improves.
ILDR leads the grokking transition by 9-73% of training steps.
Using ILDR for early stopping reduces training time by 18.6%.
Abstract
Grokking describes a delayed generalization phenomenon in which a neural network achieves perfect training accuracy long before validation accuracy improves, followed by an abrupt transition to strong generalization. Existing detection signals are indirect: weight norm reflects parameter-space regularization and consistently lags the transition, while GrokFast's slow gradient EMA, used without gradient amplification, is unstable across seeds with standard deviation exceeding mean lead time. We propose the Inter/Intra-class Distance Ratio (ILDR), a geometric metric computed on second-to-last layer representations as the ratio of inter-class centroid separation to intra-class scatter. ILDR provides an early detection signal: it rises and crosses a threshold at 2.5 times its baseline before the grokking transition appears in validation accuracy, indicating early geometric reorganization in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
