High heritability does not imply accurate prediction under the small additive effects hypothesis
Arthur Frouin (1), Claire Dandine-Roulland (1), Morgane Pierre-Jean, (1), Jean-Fran\c{c}ois Deleuze (1), Christophe Ambroise (2), Edith Le Floch, (1) ((1) CNRGH, Institut Jacob, CEA - Universit\'e Paris-Saclay, (2) LaMME,, Universit\'e Paris-Saclay, CNRS

TL;DR
This paper demonstrates that high heritability estimates do not necessarily lead to accurate predictions in genomic data, especially when the number of SNPs exceeds the sample size, using a machine learning approach.
Contribution
It introduces a predictive method for estimating genomic heritability with ridge regression and derives formulas linking prediction accuracy to sample size and SNP count.
Findings
High heritability does not guarantee prediction accuracy when p>n.
Ridge regression with GCV provides consistent heritability estimates.
Prediction accuracy depends on the ratio n/p, not just heritability.
Abstract
Genome-Wide Association Studies (GWAS) explain only a small fraction of heritability for most complex human phenotypes. Genomic heritability estimates the variance explained by the SNPs on the whole genome using mixed models and accounts for the many small contributions of SNPs in the explanation of a phenotype. This paper approaches heritability from a machine learning perspective, and examines the close link between mixed models and ridge regression. Our contribution is twofold. First, we propose estimating genomic heritability using a predictive approach via ridge regression and Generalized Cross Validation (GCV). We show that this is consistent with classical mixed model based estimation. Second, we derive simple formulae that express prediction accuracy as a function of the ratio n/p, where n is the population size and p the total number of SNPs. These formulae clearly show that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic and phenotypic traits in livestock · Genetic Mapping and Diversity in Plants and Animals · Genetics and Plant Breeding
