An Analytical Formula of Population Gradient for two-layered ReLU network and its Applications in Convergence and Critical Point Analysis
Yuandong Tian

TL;DR
This paper derives an analytical formula for the population gradient of a two-layer ReLU network, enabling theoretical analysis of critical points and convergence, with implications for training dynamics and initialization strategies.
Contribution
It provides the first analytical formula for the population gradient of two-layer ReLU networks and analyzes critical points, convergence, and symmetry-breaking phenomena.
Findings
Critical points outside the teacher hyperplane form manifolds.
Convergence to the teacher parameters is probable under specific initialization.
Spontaneous symmetry-breaking occurs with multiple ReLU nodes upon small perturbations.
Abstract
In this paper, we explore theoretical properties of training a two-layered ReLU network with centered -dimensional spherical Gaussian input (=ReLU). We train our network with gradient descent on to mimic the output of a teacher network with the same architecture and fixed parameters . We show that its population gradient has an analytical formula, leading to interesting theoretical analysis of critical points and convergence behaviors. First, we prove that critical points outside the hyperplane spanned by the teacher parameters ("out-of-plane") are not isolated and form manifolds, and characterize in-plane critical-point-free regions for two ReLU case. On the other hand, convergence to for one ReLU node is guaranteed with at least …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · stochastic dynamics and bifurcation · Neural Networks and Applications
Methods*Communicated@Fast*How Do I Communicate to Expedia?
