Prediction in latent factor regression: Adaptive PCR and beyond

Xin Bing; Florentina Bunea; Seth Strimas-Mackey; Marten Wegkamp

arXiv:2007.10050·stat.ML·April 26, 2021·1 cites

Prediction in latent factor regression: Adaptive PCR and beyond

Xin Bing, Florentina Bunea, Seth Strimas-Mackey, Marten Wegkamp

PDF

Open Access

TL;DR

This paper provides finite sample risk bounds for principal component regression (PCR) with adaptive component selection in high-dimensional latent factor models, offering a unified theoretical framework and practical insights.

Contribution

It establishes the first finite sample risk bounds for adaptive PCR in latent factor regression, and introduces a master theorem applicable to various linear predictors.

Findings

01

Risk bounds for PCR with data-driven component selection

02

Recovery of known risk bounds for minimum-norm interpolator

03

Performance guarantees for model selection via data-splitting

Abstract

This work is devoted to the finite sample prediction risk analysis of a class of linear predictors of a response $Y \in R$ from a high-dimensional random vector $X \in R^{p}$ when $(X, Y)$ follows a latent factor regression model generated by a unobservable latent vector $Z$ of dimension less than $p$ . Our primary contribution is in establishing finite sample risk bounds for prediction with the ubiquitous Principal Component Regression (PCR) method, under the factor regression model, with the number of principal components adaptively selected from the data -- a form of theoretical guarantee that is surprisingly lacking from the PCR literature. To accomplish this, we prove a master theorem that establishes a risk bound for a large class of predictors, including the PCR predictor as a special case. This approach has the benefit of providing a unified framework for the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Stochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning