A Fast, Accurate Two-Step Linear Mixed Model for Genetic Analysis Applied to Repeat MRI Measurements
Qifan Yang, Gennady V. Roshchupkin, Wiro J. Niessen, Sarah E. Medland,, Alyssa H. Zhu, Paul M. Thompson, Neda Jahanshad

TL;DR
The paper introduces a novel, non-iterative Two-Step Linear Mixed Model that significantly accelerates genetic association analysis in large biobank datasets, accurately estimates heritability, and efficiently handles repeated phenotypic measurements.
Contribution
It presents a new Two-Step LMM approach that reduces computational complexity to linear scale, enabling faster and more accurate genetic analysis of large-scale biobank data.
Findings
Accurately estimates heritability even with complex genetic relationships.
Speeds up analysis, reducing computation time from days to hours.
Improves heritability estimates for repeated MRI measurements.
Abstract
Large-scale biobanks are being collected around the world in efforts to better understand human health and risk factors for disease. They often survey hundreds of thousands of individuals, combining questionnaires with clinical, genetic, demographic, and imaging assessments; some of this data may be collected longitudinally. Genetic associations analysis of such datasets requires methods to properly handle relatedness, population structure and other types of biases introduced by confounders. Most popular and accurate approaches rely on linear mixed model (LMM) algorithms, which are iterative and computational complexity of each iteration scales by the square of the sample size, slowing the pace of discoveries (up to several days for single trait analysis), and, furthermore, limiting the use of repeat phenotypic measurements. Here, we describe our new, non-iterative, much faster and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology · Genetic and phenotypic traits in livestock · Statistical Methods and Inference
