Average Gradient Outer Product in kernel regression provably recovers the central subspace for multi-index models
Libin Zhu, Damek Davis, Dmitriy Drusvyatskiy, Maryam Fazel

TL;DR
This paper proves that the top eigenspace of the Average Gradient Outer Product (AGOP) from kernel ridge regression can reliably recover the central subspace in multi-index models, even with limited samples.
Contribution
It establishes a theoretical guarantee that AGOP-based subspace recovery is effective in low-sample regimes, explaining the efficiency of iterative kernel methods.
Findings
AGOP eigenspace recovers the central subspace under reasonable assumptions.
Subspace recovery occurs in lower sample regimes than prediction accuracy requires.
Demonstrates a separation between prediction error and representation learning.
Abstract
We study a prototypical situation when a learned predictor can discover useful low-dimensional structure in data, while using fewer samples than are needed for accurate prediction. Specifically, we consider the problem of recovering a multi-index polynomial , with and , from finitely many data/label pairs. Importantly, the target function depends on input only through the projection onto an unknown -dimensional central subspace. The algorithm we analyze is appealingly simple: fit kernel ridge regression (KRR) to the data and compute the Average Gradient Outer Product (AGOP) from the fitted predictor. Our main results show that under reasonable assumptions the top -dimensional eigenspace of AGOP provably recovers the central subspace, even in regimes when the prediction error remains large. Specifically, if the target function…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
