The Underlying Correlated Dynamics in Neural Training
Rotem Turjeman, Tom Berkov, Ido Cohen, Guy Gilboa

TL;DR
This paper introduces correlation mode decomposition (CMD), a method that models neural network training dynamics by grouping highly correlated parameters, achieving significant dimensionality reduction and improved understanding of training behavior.
Contribution
The paper presents a novel correlation-based model that reduces the complexity of neural training dynamics, applicable to large networks like ResNet-18, transformers, and GANs.
Findings
CMD achieves remarkable dimensionality reduction.
Modes are spread throughout network layers.
Regularization from CMD improves test set generalization.
Abstract
Training of neural networks is a computationally intensive task. The significance of understanding and modeling the training dynamics is growing as increasingly larger networks are being trained. We propose in this work a model based on the correlation of the parameters' dynamics, which dramatically reduces the dimensionality. We refer to our algorithm as \emph{correlation mode decomposition} (CMD). It splits the parameter space into groups of parameters (modes) which behave in a highly correlated manner through the epochs. We achieve a remarkable dimensionality reduction with this approach, where networks like ResNet-18, transformers and GANs, containing millions of parameters, can be modeled well using just a few modes. We observe each typical time profile of a mode is spread throughout the network in all layers. Moreover, our model induces regularization which yields better…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Model Reduction and Neural Networks · Computational Physics and Python Applications
MethodsTest
