Prediction in latent factor regression: Adaptive PCR and beyond
Xin Bing, Florentina Bunea, Seth Strimas-Mackey, Marten Wegkamp

TL;DR
This paper provides finite sample risk bounds for principal component regression (PCR) with adaptive component selection in high-dimensional latent factor models, offering a unified theoretical framework and practical insights.
Contribution
It establishes the first finite sample risk bounds for adaptive PCR in latent factor regression, and introduces a master theorem applicable to various linear predictors.
Findings
Risk bounds for PCR with data-driven component selection
Recovery of known risk bounds for minimum-norm interpolator
Performance guarantees for model selection via data-splitting
Abstract
This work is devoted to the finite sample prediction risk analysis of a class of linear predictors of a response from a high-dimensional random vector when follows a latent factor regression model generated by a unobservable latent vector of dimension less than . Our primary contribution is in establishing finite sample risk bounds for prediction with the ubiquitous Principal Component Regression (PCR) method, under the factor regression model, with the number of principal components adaptively selected from the data -- a form of theoretical guarantee that is surprisingly lacking from the PCR literature. To accomplish this, we prove a master theorem that establishes a risk bound for a large class of predictors, including the PCR predictor as a special case. This approach has the benefit of providing a unified framework for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Stochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning
