Find the dimension that counts: Fast dimension estimation and Krylov PCA

Shashanka Ubaru; Abd-Krim Seghouane; and Yousef Saad

arXiv:1810.03733·cs.NA·October 10, 2018

Find the dimension that counts: Fast dimension estimation and Krylov PCA

Shashanka Ubaru, Abd-Krim Seghouane, and Yousef Saad

PDF

TL;DR

This paper introduces a fast, cost-effective method for estimating the dimension of principal subspaces in high-dimensional data, combining novel model selection with Krylov subspace techniques for efficient PCA approximation.

Contribution

The paper presents a new dimension estimation method integrated with Krylov PCA, achieving strong consistency and avoiding explicit covariance matrix computation.

Findings

01

Method achieves strong consistency as data size increases.

02

Algorithm yields near optimal PCA results.

03

Avoids explicit covariance matrix formation, reducing computational cost.

Abstract

High dimensional data and systems with many degrees of freedom are often characterized by covariance matrices. In this paper, we consider the problem of simultaneously estimating the dimension of the principal (dominant) subspace of these covariance matrices and obtaining an approximation to the subspace. This problem arises in the popular principal component analysis (PCA), and in many applications of machine learning, data analysis, signal and image processing, and others. We first present a novel method for estimating the dimension of the principal subspace. We then show how this method can be coupled with a Krylov subspace method to simultaneously estimate the dimension and obtain an approximation to the subspace. The dimension estimation is achieved at no additional cost. The proposed method operates on a model selection framework, where the novel selection criterion is derived…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsPrincipal Components Analysis