Estimating the construct validity of Principal Components Analysis
Thomas M.H. Hope, Cathy J. Price, Ajay Halai, Carola Salvi, Jenny, Crinion, Merel Keijsers, Christoph Sperber, Howard Bowman

TL;DR
This paper evaluates how well PCA captures true underlying sources of variance in data, highlighting its strengths, limitations, and the importance of assumptions, especially in neuropsychological applications.
Contribution
It systematically analyzes the assumptions affecting PCA's construct validity and proposes a method to better align PCA inferences with empirical data.
Findings
PCA performs well under ideal conditions but is sensitive to assumption violations.
Variance explained and replication do not reliably predict true sources.
Empirical neuropsychological data show PCA components often lack meaningful brain referents.
Abstract
In many scientific disciplines, the features of interest cannot be observed directly, so must instead be inferred from observed behaviour. Latent variable analyses are increasingly employed to systematise these inferences, and Principal Components Analysis (PCA) is perhaps the simplest and most popular of these methods. Here, we examine how the assumptions that we are prepared to entertain, about the latent variable system, mediate the likelihood that PCA-derived components will capture the true sources of variance underlying data. As expected, we find that this likelihood is excellent in the best case, and robust to empirically reasonable levels of measurement noise, but best-case performance is also: (a) not robust to violations of the method's more prominent assumptions, of linearity and orthogonality; and also (b) requires that other subtler assumptions be made, such as that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFunctional Brain Connectivity Studies
