Adding One Neuron Can Eliminate All Bad Local Minima
Shiyu Liang, Ruoyu Sun, Jason D. Lee, R. Srikant

TL;DR
This paper demonstrates that adding a single specially designed neuron with a skip connection to neural networks can eliminate all bad local minima, ensuring that every local minimum is globally optimal in binary classification tasks.
Contribution
The authors prove that a single neuron with a skip connection can transform the loss landscape so that all local minima are global, under mild assumptions.
Findings
Adding one neuron with skip connection guarantees global optimality at all local minima
The result applies to neural networks for binary classification
The landscape becomes more favorable for training algorithms
Abstract
One of the main difficulties in analyzing neural networks is the non-convexity of the loss function which may have many bad local minima. In this paper, we study the landscape of neural networks for binary classification tasks. Under mild assumptions, we prove that after adding one special neuron with a skip connection to the output, or one special neuron per layer, every local minimum is a global minimum.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Stochastic Gradient Optimization Techniques · Neural Networks and Applications
