Multi-level Latent Variable Models for Coheritability Analysis in Electronic Health Records
Yinjun Zhao, Nicholas Tatonetti, Yuanjia Wang

TL;DR
This paper introduces a multi-level latent variable modeling framework for estimating heritability and coheritability in EHR data, effectively capturing complex familial and phenotypic structures.
Contribution
It presents a novel, scalable statistical approach using multi-level latent variables and GEE algorithms for joint heritability and genetic correlation estimation in diverse phenotypes.
Findings
Identified significant genetic correlations between mental health and metabolic phenotypes.
Demonstrated estimator consistency and validity through simulation studies.
Applied method to real EHR data revealing shared genetic influences.
Abstract
Electronic health records (EHRs) linked with familial relationship data offer a unique opportunity to investigate the genetic architecture of complex phenotypes at scale. However, existing heritability and coheritability estimation methods often fail to account for the intricacies of familial correlation structures, heterogeneity across phenotype types, and computational scalability. We propose a robust and flexible statistical framework for jointly estimating heritability and genetic correlation among continuous and binary phenotypes in EHR-based family studies. Our approach builds on multi-level latent variable models to decompose phenotypic covariance into interpretable genetic and environmental components, incorporating both within- and between-family variations. We derive iteration algorithms based on generalized equation estimations (GEE) for estimation. Simulation studies under…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology · Statistical Methods and Inference · Machine Learning in Healthcare
