Correlation of Data Reconstruction Error and Shrinkages in Pair-wise Distances under Principal Component Analysis (PCA)
Abdulrahman Oladipupo Ibraheem

TL;DR
This paper investigates how data reconstruction errors relate to pair-wise distance shrinkages in PCA, revealing a strong correlation between eigenvalues of discarded components and distance shrinkage, but not with classification accuracy.
Contribution
It provides theoretical proofs and empirical evidence linking PCA eigenvalues, distance shrinkages, and reconstruction errors, highlighting their relationships.
Findings
Strong correlation between eigenvalues of discarded components and distance shrinkage
No significant correlation between eigenvalues or shrinkages and classification accuracy
Theoretical bounds on pair-wise distance shrinkage related to reconstruction error
Abstract
In this on-going work, I explore certain theoretical and empirical implications of data transformations under the PCA. In particular, I state and prove three theorems about PCA, which I paraphrase as follows: 1). PCA without discarding eigenvector rows is injective, but looses this injectivity when eigenvector rows are discarded 2). PCA without discarding eigen- vector rows preserves pair-wise distances, but tends to cause pair-wise distances to shrink when eigenvector rows are discarded. 3). For any pair of points, the shrinkage in pair-wise distance is bounded above by an L1 norm reconstruction error associated with the points. Clearly, 3). suggests that there might exist some correlation between shrinkages in pair-wise distances and mean square reconstruction error which is defined as the sum of those eigenvalues associated with the discarded eigenvectors. I therefore decided to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpectroscopy and Chemometric Analyses · Face and Expression Recognition · Blind Source Separation Techniques
MethodsPrincipal Components Analysis
