The Generative Leap: Sharp Sample Complexity for Efficiently Learning Gaussian Multi-Index Models
Alex Damian, Jason D. Lee, Joan Bruna

TL;DR
This paper introduces the generative leap exponent for Gaussian multi-index models, establishing optimal sample complexity bounds and proposing an agnostic spectral U-statistic method for efficient hidden subspace estimation.
Contribution
It extends the concept of the generative exponent to multi-index models, providing matching lower and upper bounds on sample complexity and a novel estimation procedure.
Findings
Sample complexity of $\Theta(d^{1 \vee \k/2})$ is necessary and sufficient.
The proposed spectral U-statistic estimator is agnostic and effective.
The generative leap exponent is computed for deep neural network examples.
Abstract
In this work we consider generic Gaussian Multi-index models, in which the labels only depend on the (Gaussian) -dimensional inputs through their projection onto a low-dimensional subspace, and we study efficient agnostic estimation procedures for this hidden subspace. We introduce the \emph{generative leap} exponent , a natural extension of the generative exponent from [Damian et al.'24] to the multi-index setting. We first show that a sample complexity of is necessary in the class of algorithms captured by the Low-Degree-Polynomial framework. We then establish that this sample complexity is also sufficient, by giving an agnostic sequential estimation procedure (that is, requiring no prior knowledge of the multi-index model) based on a spectral U-statistic over appropriate Hermite tensors. We further compute the generative leap…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Gaussian Processes and Bayesian Inference · Generative Adversarial Networks and Image Synthesis
