The Landscape of Empirical Risk for Non-convex Losses
Song Mei, Yu Bai, Andrea Montanari

TL;DR
This paper analyzes the landscape of empirical risk in non-convex high-dimensional estimation, establishing uniform convergence of gradients and Hessians, and characterizing stationary points to understand optimization behavior.
Contribution
It provides a theoretical framework linking population and empirical risk landscapes for non-convex problems, including high-dimensional and sparse settings.
Findings
Uniform convergence of gradient and Hessian under sample size conditions
Complete landscape characterization for specific non-convex problems
Extension to high-dimensional sparse regimes with minimal sample complexity
Abstract
Most high-dimensional estimation and prediction methods propose to minimize a cost function (empirical risk) that is written as a sum of losses associated to each data point. In this paper we focus on the case of non-convex losses, which is practically important but still poorly understood. Classical empirical process theory implies uniform convergence of the empirical risk to the population risk. While uniform convergence implies consistency of the resulting M-estimator, it does not ensure that the latter can be computed efficiently. In order to capture the complexity of computing M-estimators, we propose to study the landscape of the empirical risk, namely its stationary points and their properties. We establish uniform convergence of the gradient and Hessian of the empirical risk to their population counterparts, as soon as the number of samples becomes larger than the number of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Sparse and Compressive Sensing Techniques · Statistical Methods and Inference
