On estimation of the noise variance in high-dimensional probabilistic principal component analysis
Damien Passemier (ECE), Zhaoyuan Li (DSAS), Jian-Feng Yao (DSAS)

TL;DR
This paper introduces a bias-corrected estimator for noise variance in high-dimensional probabilistic PCA, improving accuracy over traditional methods and enhancing procedures for determining the number of principal components.
Contribution
The paper develops a novel bias-corrected estimator for noise variance in high-dimensional PCA using random matrix theory, addressing a key unresolved issue.
Findings
The bias-corrected estimator outperforms existing estimators in simulations.
Replacing existing variance estimators improves PCA component determination methods.
New asymptotics for goodness-of-fit statistics are derived under high-dimensional settings.
Abstract
In this paper, we develop new statistical theory for probabilistic principal component analysis models in high dimensions. The focus is the estimation of the noise variance, which is an important and unresolved issue when the number of variables is large in comparison with the sample size. We first unveil the reasons of a widely observed downward bias of the maximum likelihood estimator of the variance when the data dimension is high. We then propose a bias-corrected estimator using random matrix theory and establish its asymptotic normality. The superiority of the new (bias-corrected) estimator over existing alternatives is first checked by Monte-Carlo experiments with various combinations of (dimension and sample size). In order to demonstrate further potential benefits from the results of the paper to general probability PCA analysis, we provide evidence of net improvements…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRandom Matrices and Applications · Bayesian Methods and Mixture Models · Advanced Combinatorial Mathematics
