The Landscape of Empirical Risk for Non-convex Losses

Song Mei; Yu Bai; Andrea Montanari

arXiv:1607.06534·stat.ML·January 17, 2017·59 cites

The Landscape of Empirical Risk for Non-convex Losses

Song Mei, Yu Bai, Andrea Montanari

PDF

Open Access

TL;DR

This paper analyzes the landscape of empirical risk in non-convex high-dimensional estimation, establishing uniform convergence of gradients and Hessians, and characterizing stationary points to understand optimization behavior.

Contribution

It provides a theoretical framework linking population and empirical risk landscapes for non-convex problems, including high-dimensional and sparse settings.

Findings

01

Uniform convergence of gradient and Hessian under sample size conditions

02

Complete landscape characterization for specific non-convex problems

03

Extension to high-dimensional sparse regimes with minimal sample complexity

Abstract

Most high-dimensional estimation and prediction methods propose to minimize a cost function (empirical risk) that is written as a sum of losses associated to each data point. In this paper we focus on the case of non-convex losses, which is practically important but still poorly understood. Classical empirical process theory implies uniform convergence of the empirical risk to the population risk. While uniform convergence implies consistency of the resulting M-estimator, it does not ensure that the latter can be computed efficiently. In order to capture the complexity of computing M-estimators, we propose to study the landscape of the empirical risk, namely its stationary points and their properties. We establish uniform convergence of the gradient and Hessian of the empirical risk to their population counterparts, as soon as the number of samples becomes larger than the number of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Sparse and Compressive Sensing Techniques · Statistical Methods and Inference