Average Gradient Outer Product in kernel regression provably recovers the central subspace for multi-index models

Libin Zhu; Damek Davis; Dmitriy Drusvyatskiy; Maryam Fazel

arXiv:2605.15082·stat.ML·May 15, 2026

Average Gradient Outer Product in kernel regression provably recovers the central subspace for multi-index models

Libin Zhu, Damek Davis, Dmitriy Drusvyatskiy, Maryam Fazel

PDF

TL;DR

This paper proves that the top eigenspace of the Average Gradient Outer Product (AGOP) from kernel ridge regression can reliably recover the central subspace in multi-index models, even with limited samples.

Contribution

It establishes a theoretical guarantee that AGOP-based subspace recovery is effective in low-sample regimes, explaining the efficiency of iterative kernel methods.

Findings

01

AGOP eigenspace recovers the central subspace under reasonable assumptions.

02

Subspace recovery occurs in lower sample regimes than prediction accuracy requires.

03

Demonstrates a separation between prediction error and representation learning.

Abstract

We study a prototypical situation when a learned predictor can discover useful low-dimensional structure in data, while using fewer samples than are needed for accurate prediction. Specifically, we consider the problem of recovering a multi-index polynomial $f^{*} (x) = h (U x)$ , with $U \in R^{r \times d}$ and $r ≪ d$ , from finitely many data/label pairs. Importantly, the target function depends on input $x$ only through the projection onto an unknown $r$ -dimensional central subspace. The algorithm we analyze is appealingly simple: fit kernel ridge regression (KRR) to the data and compute the Average Gradient Outer Product (AGOP) from the fitted predictor. Our main results show that under reasonable assumptions the top $r$ -dimensional eigenspace of AGOP provably recovers the central subspace, even in regimes when the prediction error remains large. Specifically, if the target function…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.