Characterization of Gaussian Universality Breakdown in High-Dimensional Empirical Risk Minimization
Chiheb Yaakoubi, Cosme Louart, Malik Tiomoko, Zhenyu Liao

TL;DR
This paper extends Gaussian universality analysis to high-dimensional empirical risk minimization with non-Gaussian data, providing an asymptotic characterization of estimators and clarifying universality limits.
Contribution
It introduces a heuristic extension of the CGMT to non-Gaussian data, deriving an asymptotic min-max characterization of ERM estimators and their Gaussian convolution approximation.
Findings
ERM estimator projections approximate a convolution of non-Gaussian and Gaussian distributions.
Any twice-differentiable regularizer is asymptotically equivalent to a quadratic form.
Numerical simulations validate the theoretical predictions.
Abstract
We study high-dimensional convex empirical risk minimization (ERM) under general non-Gaussian data designs. By heuristically extending the Convex Gaussian Min-Max Theorem (CGMT) to non-Gaussian settings, we derive an asymptotic min-max characterization of key statistics, enabling approximation of the mean and covariance of the ERM estimator . Specifically, under a concentration assumption on the data matrix and standard regularity conditions on the loss and regularizer, we show that for a test covariate independent of the training data, the projection approximately follows the convolution of the (generally non-Gaussian) distribution of with an independent centered Gaussian variable of variance . This result clarifies the scope and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
