Asymptotics of Learning with Deep Structured (Random) Features
Dominik Schr\"oder, Daniil Dmitriev, Hugo Cui, Bruno Loureiro

TL;DR
This paper provides a precise asymptotic analysis of test error for learning with deep structured random features in high-dimensional settings, linking it to feature covariance and neural network structures.
Contribution
It offers a novel asymptotic characterization of test error for deep structured random features, including formulas for feature covariance in complex neural networks.
Findings
Asymptotic test error characterized in terms of feature covariance.
Closed-form formula derived for feature covariance in deep networks.
Results can describe features learned by finite neural networks trained with gradient descent.
Abstract
For a large class of feature maps we provide a tight asymptotic characterisation of the test error associated with learning the readout layer, in the high-dimensional limit where the input dimension, hidden layer widths, and number of training samples are proportionally large. This characterization is formulated in terms of the population covariance of the features. Our work is partially motivated by the problem of learning with Gaussian rainbow neural networks, namely deep non-linear fully-connected networks with random but structured weights, whose row-wise covariances are further allowed to depend on the weights of previous layers. For such networks we also derive a closed-form formula for the feature covariance in terms of the weight matrices. We further find that in some cases our results can capture feature maps learned by deep, finite-width neural networks trained under gradient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Bayesian Methods and Mixture Models · Image Processing and 3D Reconstruction
