Asymptotics of Non-Convex Generalized Linear Models in High-Dimensions: A proof of the replica formula

Matteo Vilucchio; Yatin Dandi; Mat\'eo Pirio Rossignol; Cedric Gerbelot; Florent Krzakala

arXiv:2502.20003·stat.ML·January 13, 2026

Asymptotics of Non-Convex Generalized Linear Models in High-Dimensions: A proof of the replica formula

Matteo Vilucchio, Yatin Dandi, Mat\'eo Pirio Rossignol, Cedric Gerbelot, Florent Krzakala

PDF

Open Access

TL;DR

This paper rigorously proves the replica-symmetric formulas for high-dimensional non-convex Generalized Linear Models, confirming physicists' predictions and establishing conditions for their validity using advanced mathematical tools.

Contribution

It introduces a systematic framework combining Gaussian Min-Max Theorem and Approximate Message Passing to validate replica predictions in non-convex high-dimensional optimization.

Findings

01

Proves the optimality of Tukey loss over Huber loss under contaminated data.

02

Establishes the optimality of negative regularization in non-convex regression.

03

Characterizes performance limits of linearized AMP algorithms.

Abstract

The analytic characterization of the high-dimensional behavior of optimization for Generalized Linear Models (GLMs) with Gaussian data has been a central focus in statistics and probability in recent years. While convex cases, such as the LASSO, ridge regression, and logistic regression, have been extensively studied using a variety of techniques, the non-convex case remains far less understood despite its significance. A non-rigorous statistical physics framework has provided remarkable predictions for the behavior of high-dimensional optimization problems, but rigorously establishing their validity for non-convex problems has remained a fundamental challenge. In this work, we address this challenge by developing a systematic framework that rigorously proves replica-symmetric formulas for non-convex GLMs and precisely determines the conditions under which these formulas are valid.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Statistical Methods and Inference

MethodsHuber loss · Adversarial Model Perturbation · Focus · ALIGN