Uncertainty of high-dimensional genetic data prediction with polygenic risk scores
Haoxuan Fu, Jiaoyang Huang, Zirui Fan, Bingxin Zhao

TL;DR
This paper investigates the uncertainty in high-dimensional polygenic risk scores (PRS), establishing their asymptotic normality without sparsity assumptions, and highlights the importance of accounting for prediction-induced variance in genetic studies.
Contribution
It develops new asymptotic normality results for PRS estimators in high-dimensional settings without sparsity constraints, using advanced random matrix theory.
Findings
Ignoring PRS uncertainty underestimates variance and overstates confidence
Asymptotic normality holds for individual and cohort-level PRS measures
Numerical validation confirms theoretical insights using UK Biobank data
Abstract
In many predictive tasks, there are a large number of true predictors with weak signals, leading to substantial uncertainties in prediction outcomes. The polygenic risk score (PRS) is an example of such a scenario, where many genetic variants are used as predictors for complex traits, each contributing only a small amount of information. Although PRS has been a standard tool in genetic predictions, its uncertainty remains largely unexplored. In this paper, we aim to establish the asymptotic normality of PRS in high-dimensional predictions without sparsity constraints. We investigate the popular marginal and ridge-type estimators in PRS applications, developing central limit theorems for both individual-level predicted values (e.g., genetically predicted human height) and cohort-level prediction accuracy measures (e.g., overall predictive -squared in the testing dataset). Our results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Algorithms and Applications
