Topological trivialization in non-convex empirical risk minimization
Andrea Montanari, Basil Saeed

TL;DR
This paper investigates the landscape of non-convex empirical risk functions in high-dimensional settings, establishing conditions under which all local minima are essentially global, leading to a simplified optimization landscape.
Contribution
It introduces a variational formula for the expected number of local minima in non-convex models and identifies conditions for rate trivialization in high-dimensional regimes.
Findings
Expected number of local minima can be controlled asymptotically
Conditions for the vanishing of local minima growth rate are established
Numerical simulations support the conjecture of a phase transition point
Abstract
Given data , with standard -dimensional Gaussian feature vectors, and response variables, we study the general problem of learning a model parametrized by , by minimizing a loss function that depends on via the one-dimensional projections . While previous work mostly dealt with convex losses, our approach assumes general (non-convex) losses hence covering classical, yet poorly understood examples such as the perceptron and non-convex robust regression. We use the Kac-Rice formula to control the asymptotics of the expected number of local minima of the empirical risk, under the proportional asymptotics , . Specifically, we prove a finite dimensional variational formula for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Stochastic Gradient Optimization Techniques · Point processes and geometric inequalities
