Input-Label Correlation Governs a Linear-to-Nonlinear Transition in Random Features under Spiked Covariance
Samet Demir, Zafer Dogan

TL;DR
This paper reveals how input-label correlation and data anisotropy cause a phase transition in random feature models, determining whether they act as linear predictors or exhibit nonlinear advantages, supported by theoretical analysis and experiments.
Contribution
It introduces a universality principle and explicit boundary conditions for the linear-to-nonlinear transition in RFMs under spiked covariance data.
Findings
RFM behavior depends on input-label correlation and data anisotropy.
A phase transition boundary separates linear and nonlinear regimes.
Numerical and real-data experiments validate the theoretical predictions.
Abstract
Random feature models (RFMs), two-layer networks with a randomly initialized fixed first layer and a trained linear readout, are among the simplest nonlinear predictors. Prior asymptotic analyses in the proportional high-dimensional regime show that, under isotropic data, RFMs reduce to noisy linear models and offer no advantage over classical linear methods such as ridge regression. Yet RFMs frequently outperform linear baselines on structured real data. We show that this tension is explained by a correlation-driven phase transition: under spiked-covariance designs, the interaction between anisotropy and input-label correlation determines whether the RFM behaves as an effectively linear predictor or exhibits genuinely nonlinear gains. Concretely, we establish a universality principle under anisotropy and characterize the RFM generalization error via an equivalent noisy polynomial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models
