Find the dimension that counts: Fast dimension estimation and Krylov PCA
Shashanka Ubaru, Abd-Krim Seghouane, and Yousef Saad

TL;DR
This paper introduces a fast, cost-effective method for estimating the dimension of principal subspaces in high-dimensional data, combining novel model selection with Krylov subspace techniques for efficient PCA approximation.
Contribution
The paper presents a new dimension estimation method integrated with Krylov PCA, achieving strong consistency and avoiding explicit covariance matrix computation.
Findings
Method achieves strong consistency as data size increases.
Algorithm yields near optimal PCA results.
Avoids explicit covariance matrix formation, reducing computational cost.
Abstract
High dimensional data and systems with many degrees of freedom are often characterized by covariance matrices. In this paper, we consider the problem of simultaneously estimating the dimension of the principal (dominant) subspace of these covariance matrices and obtaining an approximation to the subspace. This problem arises in the popular principal component analysis (PCA), and in many applications of machine learning, data analysis, signal and image processing, and others. We first present a novel method for estimating the dimension of the principal subspace. We then show how this method can be coupled with a Krylov subspace method to simultaneously estimate the dimension and obtain an approximation to the subspace. The dimension estimation is achieved at no additional cost. The proposed method operates on a model selection framework, where the novel selection criterion is derived…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsPrincipal Components Analysis
