# Interpreting artificial neural networks to detect genome-wide association signals for complex traits

**Authors:** Burak Yelmen, Maris Alver, Merve Nur Güler, Flora Jay, Lili Milani

PMC · DOI: 10.1093/nargab/lqag019 · NAR Genomics and Bioinformatics · 2026-02-23

## TL;DR

This paper shows how artificial neural networks can detect genetic signals for complex diseases like schizophrenia that traditional methods miss.

## Contribution

A novel approach using neural networks and post hoc interpretability to detect genome-wide association signals with estimated P-values.

## Key findings

- Neural networks detected loci not identified by linear methods in schizophrenia data.
- PAL showed significant overlap with previously known loci for schizophrenia and bipolar disorder.
- Genes in PAL were enriched for brain morphology and function-related terms.

## Abstract

Investigating the genetic architecture of complex diseases is challenging due to the multifactorial interplay of genomic and environmental influences. Although GWAS have identified thousands of variants for multiple complex traits, conventional statistical approaches can be limited by simplified assumptions such as linearity and lack of epistasis. In this work, we trained artificial neural networks using genome-wide genotype data to predict simulated and real complex traits. We extracted feature importance scores via different post hoc interpretability methods to identify potentially associated locus/loci (PAL) for the target phenotype and devised an approach for estimating P-values for the detected PAL. Simulations demonstrated that associated loci can be detected with good precision using strict selection criteria. By applying our approach to the schizophrenia cohort in the Estonian Biobank, we detected multiple loci not identified by linear methods. There was significant concordance between PAL and loci previously associated with schizophrenia and bipolar disorder, with enrichment analyses of genes within the identified PAL predominantly highlighting terms related to brain morphology and function. With advancements in model optimization and uncertainty quantification, artificial neural networks have the potential to enhance the identification of genomic loci associated with complex diseases, offering a more comprehensive approach for GWAS.

## Linked entities

- **Diseases:** schizophrenia (MONDO:0005090), bipolar disorder (MONDO:0004985)

## Full-text entities

- **Genes:** SHCBP1 (SHC binding and spindle associated 1) [NCBI Gene 79801] {aka PAL}, TOP6BL (TOP6B like initiator of meiotic double strand breaks) [NCBI Gene 79703] {aka C11orf80, HYDM4, TOPOVIBL}, NTF3 (neurotrophin 3) [NCBI Gene 4908] {aka HDNF, NGF-2, NGF2, NT-3, NT3}, C3orf62 (chromosome 3 open reading frame 62) [NCBI Gene 375341] {aka MAPS}, PC (pyruvate carboxylase) [NCBI Gene 5091] {aka PCB}, TET2 (tet methylcytosine dioxygenase 2) [NCBI Gene 54790] {aka IMD75, KIAA1546, MDS}, SYT12 (synaptotagmin 12) [NCBI Gene 91683] {aka sytXII}, USP19 (ubiquitin specific peptidase 19) [NCBI Gene 10869] {aka ZMYND9}, KLHDC8B (kelch domain containing 8B) [NCBI Gene 200942] {aka CHL}, HLA-A (major histocompatibility complex, class I, A) [NCBI Gene 3105] {aka HLAA}, CCDC71 (coiled-coil domain containing 71) [NCBI Gene 64925], PRKAR2A (protein kinase cAMP-dependent type II regulatory subunit alpha) [NCBI Gene 5576] {aka PKR2, PRKAR2}, PPA2 (inorganic pyrophosphatase 2) [NCBI Gene 27068] {aka HSPC124, SCFAI, SCFI, SID6-306}
- **Diseases:** SCZ (MESH:D012559), mental, behavioural, and neurodevelopmental disorders (MESH:D001523), ADHD (MESH:D001289), autism spectrum disorder (MESH:D000067877), MAS (MESH:D020969), major depressive disorder (MESH:D003865), bipolar II (MESH:D001714), T1D (MESH:D003922), TP (MESH:C579935), mood disturbance (MESH:D019964), depressive symptoms (MESH:D003866)
- **Chemicals:** PM (-), SM (MESH:D012493)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Mutations:** rs6779394, rs7944999, rs7647812, rs1892928, rs7122539, rs7617480, rs11063650, rs10010325, rs2454206, rs4955418, rs7674220, rs2726528, rs1603705, rs10461139, rs12631989, rs10896135, rs2713871, rs4915842, rs4955417, rs7454792, rs12642383, rs6855629

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12964191/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12964191/full.md

## References

55 references — full list in the complete paper: https://tomesphere.com/paper/PMC12964191/full.md

---
Source: https://tomesphere.com/paper/PMC12964191