Inference in latent factor regression with clusterable features
Xin Bing, Florentina Bunea, Marten Wegkamp

TL;DR
This paper develops new inferential tools for estimating and making inferences about the regression coefficient in latent factor models with clusterable features, addressing longstanding challenges in variance estimation and model interpretability.
Contribution
It introduces computationally efficient estimators for the regression coefficient without prior knowledge of the number of latent factors or mixture structure, and provides a unified asymptotic distribution analysis.
Findings
Estimator is minimax-rate adaptive.
Provides a closed-form asymptotic variance expression.
Valid for high-dimensional settings with p and K varying with n.
Abstract
Regression models, in which the observed features and the response depend, jointly, on a lower dimensional, unobserved, latent vector , with , are popular in a large array of applications, and mainly used for predicting a response from correlated features. In contrast, methodology and theory for inference on the regression coefficient relating to are scarce, since typically the un-observable factor is hard to interpret. Furthermore, the determination of the asymptotic variance of an estimator of is a long-standing problem, with solutions known only in a few particular cases. To address some of these outstanding questions, we develop inferential tools for in a class of factor regression models in which the observed features are signed mixtures of the latent factors. The model specifications are practically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Bayesian Methods and Mixture Models · Advanced Statistical Methods and Models
