Magnitude and Angle Dynamics in Training Single ReLU Neurons
Sangmin Lee, Byeongsu Sim, Jong Chul Ye

TL;DR
This paper analyzes the dynamics of training single ReLU neurons by decomposing the process into magnitude and angle components, providing bounds and insights into convergence behavior, especially under small initialization, with experimental verification.
Contribution
It offers a theoretical framework for understanding the training dynamics of deep ReLU neurons through magnitude-angle decomposition, including bounds and convergence analysis.
Findings
Small scale initialization leads to slow convergence in deep ReLU neurons.
Theoretical bounds describe the evolution of magnitude and angle during training.
Experimental results verify the theoretical predictions.
Abstract
To understand learning the dynamics of deep ReLU networks, we investigate the dynamic system of gradient flow by decomposing it to magnitude and angle components. In particular, for multi-layer single ReLU neurons with spherically symmetric data distribution and the square loss function, we provide upper and lower bounds for magnitude and angle components to describe the dynamics of gradient flow. Using the obtained bounds, we conclude that small scale initialization induces slow convergence speed for deep single ReLU neurons. Finally, by exploiting the relation of gradient flow and gradient descent, we extend our results to the gradient descent approach. All theoretical results are verified by experiments.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Neural Networks and Applications · Gaussian Processes and Bayesian Inference
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
