Uniform error bound for PCA matrix denoising
Xin T. Tong, Wanjie Wang, Yuguan Wang

TL;DR
This paper establishes a uniform error bound for PCA-based matrix denoising in high-dimensional data, demonstrating its rate-optimality and impact on downstream tasks like clustering and manifold learning.
Contribution
It provides the first uniform error bound for PCA denoising under mild spectral gap conditions and shows its rate-optimality with practical implications.
Findings
PCA denoising achieves a uniform error bound of O(σ log n).
The spectral gap condition is satisfied for data with non-degenerate covariance.
Numerical results confirm the theoretical error bounds and their relevance to applications.
Abstract
Principal component analysis (PCA) is a simple and popular tool for processing high-dimensional data. We investigate its effectiveness for matrix denoising. We consider the clean data are generated from a low-dimensional subspace, but masked by independent high-dimensional sub-Gaussian noises with standard deviation . Under the low-rank assumption on the clean data with a mild spectral gap assumption, we prove that the distance between each pair of PCA-denoised data point and the clean data point is uniformly bounded by . To illustrate the spectral gap assumption, we show it can be satisfied when the clean data are independently generated with a non-degenerate covariance matrix. We then provide a general lower bound for the error of the denoised data matrix, which indicates PCA denoising gives a uniform error bound that is rate-optimal. Furthermore, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Image and Signal Denoising Methods · Blind Source Separation Techniques
