Adding One Neuron Can Eliminate All Bad Local Minima

Shiyu Liang; Ruoyu Sun; Jason D. Lee; R. Srikant

arXiv:1805.08671·stat.ML·May 23, 2018·50 cites

Adding One Neuron Can Eliminate All Bad Local Minima

Shiyu Liang, Ruoyu Sun, Jason D. Lee, R. Srikant

PDF

Open Access

TL;DR

This paper demonstrates that adding a single specially designed neuron with a skip connection to neural networks can eliminate all bad local minima, ensuring that every local minimum is globally optimal in binary classification tasks.

Contribution

The authors prove that a single neuron with a skip connection can transform the loss landscape so that all local minima are global, under mild assumptions.

Findings

01

Adding one neuron with skip connection guarantees global optimality at all local minima

02

The result applies to neural networks for binary classification

03

The landscape becomes more favorable for training algorithms

Abstract

One of the main difficulties in analyzing neural networks is the non-convexity of the loss function which may have many bad local minima. In this paper, we study the landscape of neural networks for binary classification tasks. Under mild assumptions, we prove that after adding one special neuron with a skip connection to the output, or one special neuron per layer, every local minimum is a global minimum.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Stochastic Gradient Optimization Techniques · Neural Networks and Applications