Cross-trait prediction accuracy of high-dimensional ridge-type estimators in genome-wide association studies
Bingxin Zhao, Hongtu Zhu

TL;DR
This paper analyzes the prediction accuracy of ridge-type estimators in high-dimensional genome-wide association studies, revealing that out-of-sample prediction becomes robust to regularization choices as feature-to-sample ratios grow.
Contribution
It provides a unifying theoretical framework for understanding ridge estimator performance in dense high-dimensional GWAS, including the impact of regularization and correlation structures.
Findings
Out-of-sample R^2 is bounded by 1/ω and largely invariant to λ for large ω.
Optimal λ selection is less critical as p/n increases, simplifying practical applications.
In-sample R^2 depends heavily on λ and differs from out-of-sample behavior.
Abstract
Marginal association summary statistics have attracted great attention in statistical genetics, mainly because the primary results of most genome-wide association studies (GWAS) are produced by marginal screening. In this paper, we study the prediction accuracy of marginal estimator in dense (or sparsity free) high-dimensional settings with , , and . We consider a general correlation structure among the features and allow an unknown subset of them to be signals. As the marginal estimator can be viewed as a ridge estimator with regularization parameter , we further investigate a class of ridge-type estimators in a unifying framework, including the popular best linear unbiased prediction (BLUP) in genetics. We find that the influence of on out-of-sample prediction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology · Genetic and phenotypic traits in livestock · Genetic Mapping and Diversity in Plants and Animals
