Maximizing the Power of Principal Components Analysis of Correlated Phenotypes in Genome-wide Association Studies
Hugues Aschard, Bjarni J. Vilhj\'almsson, Nicolas Greliche,, Pierre-Emmanuel Morange, David-Alexandre Tr\'egou\"et, Peter Kraft

TL;DR
This paper investigates how PCA can be optimized for GWAS of correlated traits, showing that using all PCs enhances power to detect various genetic effects, including pleiotropy and trait-specific associations.
Contribution
It demonstrates that testing all principal components, rather than just the top ones, improves detection of genetic associations in correlated phenotypes.
Findings
Testing all PCs increases power for detecting genetic associations.
All PCs can reveal associations with traits explaining small variance.
Application to coagulation traits identified new candidate SNPs.
Abstract
Principal Component analysis (PCA) is a useful statistical technique that is commonly used for multivariate analysis of correlated variables. It is usually applied as a dimension reduction method: the top principal components (PCs) explaining most of total variance are tested for association with a predictor of interest, and the remaining PCs are ignored. This strategy has been widely applied in genetic epidemiology, however some of its aspects are not well appreciated in the context of single nucleotide polymorphisms (SNPs) association testing. In this study, we review the theoretical basis of PCA and its behavior when testing for association between a SNP and two correlated traits under various scenarios. We then evaluate with simulations the power of several different PCA-based strategies when analyzing up to 100 correlated traits. We show that contrary to widespread practice that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology · Bioinformatics and Genomic Networks · Gene expression and cancer classification
