Modeling High-Dimensional Dependent Data in the Presence of Many Explanatory Variables and Weak Signals
Zhaoxing Gao, Ruey S. Tsay

TL;DR
This paper introduces a new method for modeling high-dimensional dependent data with many variables and weak signals, combining factor modeling, penalized regression, and PCA to improve estimation and inference.
Contribution
It develops a comprehensive approach that handles low signal-to-noise ratios and diverging eigenvalues, with theoretical guarantees and practical validation.
Findings
Effective in low SNR settings
Accurate estimation of common factors
Robust performance in simulations and applications
Abstract
This article considers a novel and widely applicable approach to modeling high-dimensional dependent data when a large number of explanatory variables are available and the signal-to-noise ratio is low. We postulate that a -dimensional response series is the sum of a linear regression with many observable explanatory variables and an error term driven by some latent common factors and an idiosyncratic noise. The common factors have dynamic dependence whereas the covariance matrix of the idiosyncratic noise can have diverging eigenvalues to handle the situation of low signal-to-noise ratio commonly encountered in applications. The regression coefficient matrix is estimated using penalized methods when the dimensions involved are high. We apply factor modeling to the regression residuals, employ a high-dimensional white noise testing procedure to determine the number of common factors,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsADaptive gradient method with the OPTimal convergence rate · Linear Regression
