Neuronal Fluctuations: Learning Rates vs Participating Neurons
Darsh Pareek, Umesh Kumar, Ruthu Rao, Ravi Janjam

TL;DR
This paper explores how different learning rates affect internal neural fluctuations in deep neural networks, revealing their impact on training dynamics and model performance to improve hyperparameter tuning strategies.
Contribution
It systematically analyzes the relationship between learning rates and neural parameter fluctuations, providing new insights into the optimization process in deep learning.
Findings
Higher learning rates increase fluctuation magnitude.
Fluctuation patterns correlate with final accuracy.
Optimal fluctuations balance exploration and exploitation.
Abstract
Deep Neural Networks (DNNs) rely on inherent fluctuations in their internal parameters (weights and biases) to effectively navigate the complex optimization landscape and achieve robust performance. While these fluctuations are recognized as crucial for escaping local minima and improving generalization, their precise relationship with fundamental hyperparameters remains underexplored. A significant knowledge gap exists concerning how the learning rate, a critical parameter governing the training process, directly influences the dynamics of these neural fluctuations. This study systematically investigates the impact of varying learning rates on the magnitude and character of weight and bias fluctuations within a neural network. We trained a model using distinct learning rates and analyzed the corresponding parameter fluctuations in conjunction with the network's final accuracy. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Stochastic Gradient Optimization Techniques · Machine Learning and Data Classification
