Consistency of heritability estimation from summary statistics in high-dimensional linear models
David Azriel, Samuel Davenport, Armin Schwartzman

TL;DR
This paper analyzes the asymptotic properties of heritability estimators in high-dimensional GWAS, identifying conditions for their consistency and highlighting biases introduced by standardization and population stratification.
Contribution
It establishes sufficient and necessary conditions for the consistency of LDSC and GWASH heritability estimators in high-dimensional models, including effects of standardization and stratification.
Findings
Weak dependence and bounded-kurtosis effects ensure estimator consistency.
Standardization can introduce bias if conditions are violated.
Population stratification causes bias not fixed by LDSC intercept adjustment.
Abstract
In Genome-Wide Association Studies (GWAS), heritability is defined as the fraction of variance of an outcome explained by a large number of genetic predictors in a high-dimensional polygenic linear model. This work studies the asymptotic properties of the most common estimator of heritability from summary statistics called linkage disequilibrium score (LDSC) regression, together with a simpler and closely related estimator called GWAS heritability (GWASH). These estimators are analyzed in their basic versions and under various modifications used in practice including weighting and standardization. We show that, with some variations, two conditions which we call weak dependence (WD) and bounded-kurtosis effects (BKE) are sufficient for consistency of both the basic LDSC with fixed intercept and GWASH estimators, for both Gaussian and non-Gaussian predictors. For Gaussian predictors it is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models
