Interpreting artificial neural networks to detect genome-wide association signals for complex traits
Burak Yelmen, Maris Alver, Merve Nur G\"uler, Estonian Biobank, Research Team, Flora Jay, Lili Milani

TL;DR
This study employs artificial neural networks to identify genomic loci linked to complex traits, overcoming limitations of traditional GWAS by capturing nonlinear and interactive effects, and validating findings with real and simulated data.
Contribution
The paper introduces a novel neural network-based approach for detecting associated loci in GWAS, including methods for interpretability and p-value estimation, improving detection accuracy.
Findings
Neural networks can identify associated loci with high precision.
Application to schizophrenia data revealed loci consistent with known biology.
Enrichment analyses linked identified loci to brain-related functions.
Abstract
Investigating the genetic architecture of complex diseases is challenging due to the multifactorial and interactive landscape of genomic and environmental influences. Although genome-wide association studies (GWAS) have identified thousands of variants for multiple complex traits, conventional statistical approaches can be limited by simplified assumptions such as linearity and lack of epistasis in models. In this work, we trained artificial neural networks to predict complex traits using both simulated and real genotype-phenotype datasets. We extracted feature importance scores via different post hoc interpretability methods to identify potentially associated loci (PAL) for the target phenotype and devised an approach for obtaining p-values for the detected PAL. Simulations with various parameters demonstrated that associated loci can be detected with good precision using strict…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology
MethodsHigh-Order Consensuses
