LSMM: A statistical approach to integrating functional annotations with genome-wide association studies
Jingsi Ming, Mingwei Dai, Mingxuan Cai, Xiang Wan, Jin Liu, Can, Yang

TL;DR
This paper introduces LSMM, a statistical model that integrates functional annotations with GWAS data to improve the identification of risk variants and provide biological insights into complex traits.
Contribution
The study presents a novel latent sparse mixed model (LSMM) with an efficient variational EM algorithm for scalable integration of functional annotations with GWAS data.
Findings
LSMM outperforms conventional methods in statistical power.
Application to real GWAS data reveals biologically relevant annotations.
LSMM enhances understanding of genetic architecture of complex traits.
Abstract
Thousands of risk variants underlying complex phenotypes (quantitative traits and diseases) have been identified in genome-wide association studies (GWAS). However, there are still two major challenges towards deepening our understanding of the genetic architectures of complex phenotypes. First, the majority of GWAS hits are in the non-coding region and their biological interpretation is still unclear. Second, accumulating evidence from GWAS suggests the polygenicity of complex traits, i.e., a complex trait is often affected by many variants with small or moderate effects, whereas a large proportion of risk variants with small effects remains unknown. The availability of functional annotation data enables us to address the above challenges. In this study, we propose a latent sparse mixed model (LSMM) to integrate functional annotations with GWAS data. Not only does it increase…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology · Gene expression and cancer classification · Genetic and phenotypic traits in livestock
