HEDE: Heritability estimation in high dimensions by Ensembling Debiased Estimators
Yanke Song, Xihong Lin, Pragya Sur

TL;DR
HEDE introduces an ensemble method combining debiased estimators for accurate heritability estimation in high-dimensional genetic models, with proven consistency and superior performance demonstrated through simulations and real data application.
Contribution
The paper presents HEDE, a novel ensemble heritability estimator for high-dimensional linear models with adaptive hyperparameter tuning and theoretical guarantees.
Findings
Outperforms existing methods across various genetic architectures.
Provides consistent heritability estimates with realistic genotype distributions.
Successfully applied to UK Biobank data for height and BMI heritability.
Abstract
Estimating heritability remains a significant challenge in statistical genetics. Diverse approaches have emerged over the years that are broadly categorized as either random effects or fixed effects heritability methods. In this work, we focus on the latter. We propose HEDE, an ensemble approach to estimate heritability or the signal-to-noise ratio in high-dimensional linear models where the sample size and the dimension grow proportionally. Our method ensembles post-processed versions of the debiased lasso and debiased ridge estimators, and incorporates a data-driven strategy for hyperparameter selection that significantly boosts estimation performance. We establish rigorous consistency guarantees that hold despite adaptive tuning. Extensive simulations demonstrate our method's superiority over existing state-of-the-art methods across various signal structures and genetic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification
