Fluctuations, Bias, Variance & Ensemble of Learners: Exact Asymptotics   for Convex Losses in High-Dimension

Bruno Loureiro; C\'edric Gerbelot; Maria Refinetti; Gabriele; Sicuro; Florent Krzakala

arXiv:2201.13383·stat.ML·October 3, 2022

Fluctuations, Bias, Variance & Ensemble of Learners: Exact Asymptotics for Convex Losses in High-Dimension

Bruno Loureiro, C\'edric Gerbelot, Maria Refinetti, Gabriele, Sicuro, Florent Krzakala

PDF

Open Access

TL;DR

This paper develops a rigorous high-dimensional asymptotic theory for fluctuations, bias, and variance in ensemble learning with convex losses, illuminating the effects of randomness and ensembling on generalization and the double-descent phenomenon.

Contribution

It provides a complete description of the joint distribution of empirical risk minimizers in high dimensions for convex losses, encompassing neural networks and kernel methods.

Findings

01

Enables analysis of ensembling effects on bias-variance decomposition.

02

Disentangles contributions of statistical fluctuations to test error.

03

Highlights the role of the interpolation threshold in double-descent.

Abstract

From the sampling of data to the initialisation of parameters, randomness is ubiquitous in modern Machine Learning practice. Understanding the statistical fluctuations engendered by the different sources of randomness in prediction is therefore key to understanding robust generalisation. In this manuscript we develop a quantitative and rigorous theory for the study of fluctuations in an ensemble of generalised linear models trained on different, but correlated, features in high-dimensions. In particular, we provide a complete description of the asymptotic joint distribution of the empirical risk minimiser for generic convex loss and regularisation in the high-dimensional limit. Our result encompasses a rich set of classification and regression tasks, such as the lazy regime of overparametrised neural networks, or equivalently the random features approximation of kernels. While allowing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Stochastic Gradient Optimization Techniques · Machine Learning and ELM