On the Stability Properties and the Optimization Landscape of Training Problems with Squared Loss for Neural Networks and General Nonlinear Conic Approximation Schemes
Constantin Christof

TL;DR
This paper analyzes the optimization landscape of training neural networks and nonlinear conic schemes with squared loss, revealing inherent instability, saddle points, and local minima that are unaffected by regularization, especially in expressive models.
Contribution
It provides a theoretical framework showing that increased expressiveness in nonlinear conic schemes leads to instability and spurious minima, regardless of realizability or regularization.
Findings
Training problems are unstable with more expressive schemes and unrealizable labels.
Existence of saddle points and spurious local minima far from global solutions.
Regularization cannot generally eliminate these instability and local minima issues.
Abstract
We study the optimization landscape and the stability properties of training problems with squared loss for neural networks and general nonlinear conic approximation schemes. It is demonstrated that, if a nonlinear conic approximation scheme is considered that is (in an appropriately defined sense) more expressive than a classical linear approximation approach and if there exist unrealizable label vectors, then a training problem with squared loss is necessarily unstable in the sense that its solution set depends discontinuously on the label vector in the training data. We further prove that the same effects that are responsible for these instability properties are also the reason for the emergence of saddle points and spurious local minima, which may be arbitrarily far away from global solutions, and that neither the instability of the training problem nor the existence of spurious…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Numerical Analysis Techniques · Model Reduction and Neural Networks · Tribology and Lubrication Engineering
