Detecting overfitting in Neural Networks during long-horizon grokking using Random Matrix Theory
Hari K. Prakash, Charles H Martin

TL;DR
This paper introduces a novel Random Matrix Theory method to detect overfitting in neural networks during long-horizon grokking without needing access to training or testing data.
Contribution
It presents a new spectral analysis technique that identifies Correlation Traps indicating overfitting, and distinguishes benign from harmful traps empirically.
Findings
Correlation Traps form during anti-grokking, correlating with decreasing test accuracy.
The method detects overfitting phases in large language models.
Correlation Traps are present in foundation-scale LLMs, indicating potential overfitting.
Abstract
Training Neural Networks (NNs) without overfitting is difficult; detecting that overfitting is difficult as well. We present a novel Random Matrix Theory method that detects the onset of overfitting in deep learning models without access to train or test data. For each model layer, we randomize each weight matrix element-wise, , fit the randomized empirical spectral distribution with a Marchenko-Pastur distribution, and identify large outliers that violate self-averaging. We call these outliers Correlation Traps. During the onset of overfitting, which we call the "anti-grokking" phase in long-horizon grokking, Correlation Traps form and grow in number and scale as test accuracy decreases while train accuracy remains high. Traps may be benign or may harm generalization; we provide an empirical approach to distinguish between them by passing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
