Fundamental computational limits of weak learnability in high-dimensional multi-index models
Emanuele Troiani, Yatin Dandi, Leonardo Defilippis, Lenka Zdeborov\'a, Bruno Loureiro, Florent Krzakala

TL;DR
This paper investigates the theoretical limits of efficiently learning low-dimensional structures in high-dimensional multi-index models, revealing phase transitions and hierarchical learning phenomena relevant to neural network training.
Contribution
It characterizes the sample complexity thresholds and phase transitions for weakly recovering subspaces in multi-index models, extending understanding of neural network learnability.
Findings
Identifies conditions for trivial subspace learnability with minimal samples.
Establishes a phase transition at a critical sample complexity for learning non-trivial subspaces.
Discovers hierarchical learning where directions are learned sequentially based on difficulty.
Abstract
Multi-index models - functions which only depend on the covariates through a non-linear transformation of their projection on a subspace - are a useful benchmark for investigating feature learning with neural nets. This paper examines the theoretical boundaries of efficient learnability in this hypothesis class, focusing on the minimum sample complexity required for weakly recovering their low-dimensional structure with first-order iterative algorithms, in the high-dimensional regime where the number of samples is proportional to the covariate dimension . Our findings unfold in three parts: (i) we identify under which conditions a trivial subspace can be learned with a single step of a first-order algorithm for any ; (ii) if the trivial subspace is empty, we provide necessary and sufficient conditions for the existence of an easy subspace where…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Statistical Methods and Inference
MethodsSparse Evolutionary Training
