On the explanatory power of principal components
Daniel A. Diaz-Pachon, J. Sunil Rao, Jean-Eudes Dazard

TL;DR
This paper investigates the probabilistic behavior of vectors relative to orthogonal bases in high-dimensional spaces and discusses implications for Principal Components Analysis in regression and learning contexts.
Contribution
It provides a probabilistic analysis of vector proximity in high-dimensional orthogonal bases and explores its implications for PCA's explanatory power.
Findings
Probability of a vector being closer to all basis vectors than other vectors approaches 1/2 as dimension increases.
Distribution of the vector's proximity converges to a normal distribution on [-1,1] with increasing dimension.
Results have significant implications for PCA in regression and learning settings.
Abstract
We show that if we have an orthogonal base () in a -dimensional vector space, and select vectors and such that the vectors traverse the origin, then the probability of being to closer to all the vectors in the base than to is at least 1/2 and converges as increases to infinity to a normal distribution on the interval [-1,1]; i.e., . This result has relevant consequences for Principal Components Analysis in the context of regression and other learning settings, if we take the orthogonal base as the direction of the principal components.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Control Systems and Identification · Statistical Methods and Inference
