Asymptotics of Non-Convex Generalized Linear Models in High-Dimensions: A proof of the replica formula
Matteo Vilucchio, Yatin Dandi, Mat\'eo Pirio Rossignol, Cedric Gerbelot, Florent Krzakala

TL;DR
This paper rigorously proves the replica-symmetric formulas for high-dimensional non-convex Generalized Linear Models, confirming physicists' predictions and establishing conditions for their validity using advanced mathematical tools.
Contribution
It introduces a systematic framework combining Gaussian Min-Max Theorem and Approximate Message Passing to validate replica predictions in non-convex high-dimensional optimization.
Findings
Proves the optimality of Tukey loss over Huber loss under contaminated data.
Establishes the optimality of negative regularization in non-convex regression.
Characterizes performance limits of linearized AMP algorithms.
Abstract
The analytic characterization of the high-dimensional behavior of optimization for Generalized Linear Models (GLMs) with Gaussian data has been a central focus in statistics and probability in recent years. While convex cases, such as the LASSO, ridge regression, and logistic regression, have been extensively studied using a variety of techniques, the non-convex case remains far less understood despite its significance. A non-rigorous statistical physics framework has provided remarkable predictions for the behavior of high-dimensional optimization problems, but rigorously establishing their validity for non-convex problems has remained a fundamental challenge. In this work, we address this challenge by developing a systematic framework that rigorously proves replica-symmetric formulas for non-convex GLMs and precisely determines the conditions under which these formulas are valid.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Statistical Methods and Inference
MethodsHuber loss · Adversarial Model Perturbation · Focus · ALIGN
