Large covariance matrix estimation with factor-assisted variable clustering
Dong Li, Xinghao Qiao, Cheng Yu

TL;DR
This paper introduces a new method for estimating high-dimensional covariance matrices in time series data by combining factor models with latent variable-specific clustering, improving interpretability and accounting for complex correlations.
Contribution
It proposes a novel factor-assisted clustering approach with a ratio-based thresholding criterion, providing theoretical guarantees and superior performance over existing methods.
Findings
Method achieves consistent cluster recovery.
Estimates converge at optimal rates under various norms.
Demonstrates improved accuracy in simulations and real data.
Abstract
This paper studies the covariance matrix estimation for high-dimensional time series within a new framework that combines low-rank factor and latent variable-specific cluster structures. The popular methods based on assuming the sparse error covariance matrix after taking out common factors may be invalid for many financial applications. Our formulation postulates a latent model-based error cluster structure after removing observable factors, which not only leads to more interpretable cluster patterns but also accounts for non-sparse cross-sectional correlations among the variable-specific residuals. Our method begins with using least-squares to estimate the factor loadings, followed by identifying the latent cluster structure by thresholding the scaled covariance difference measures of residuals. A novel ratio-based criterion is introduced to determine the threshold parameter when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace and Expression Recognition · Neural Networks and Applications
