The spectrum of kernel random matrices
Noureddine El Karoui

TL;DR
This paper analyzes the spectral properties of kernel random matrices in high-dimensional settings, revealing that their behavior simplifies to linear models, challenging common heuristics in statistical and machine learning applications.
Contribution
It provides a rigorous analysis of kernel matrix spectra in high dimensions, showing a surprising linearization effect and questioning their practical modeling relevance.
Findings
Kernel matrices behave linearly in high dimensions.
Surprising simplification contrasts with heuristic expectations.
Raises questions about the applicability of random matrix models.
Abstract
We place ourselves in the setting of high-dimensional statistical inference where the number of variables in a dataset of interest is of the same order of magnitude as the number of observations . We consider the spectrum of certain kernel random matrices, in particular matrices whose th entry is or where is the dimension of the data, and are independent data vectors. Here is assumed to be a locally smooth function. The study is motivated by questions arising in statistics and computer science where these matrices are used to perform, among other things, nonlinear versions of principal component analysis. Surprisingly, we show that in high-dimensions, and for the models we analyze, the problem becomes essentially linear--which is at odds with heuristics sometimes used to justify the usage of these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
