SAERMA: Stacked Autoencoder Rule Mining Algorithm for the Interpretation of Epistatic Interactions in GWAS for Extreme Obesity
Casimiro Aday Curbelo Monta\~nez, Paul Fergus, Carl Chalmers, Nurul, Ahamed Hassain Malim, Basma Abdulaimma, Denis Reilly, and Francesco Falciani

TL;DR
SAERMA combines deep autoencoders and rule mining to uncover complex epistatic interactions in GWAS data, improving interpretation and classification of genetic factors in extreme obesity.
Contribution
It introduces a novel method integrating stacked autoencoders with association rule mining for epistasis detection in GWAS, enhancing interpretability and predictive performance.
Findings
Achieved up to 77% AUC in classification
Identified significant SNP interactions related to obesity
Provided interpretable rules for genetic associations
Abstract
One of the most important challenges in the analysis of high-throughput genetic data is the development of efficient computational methods to identify statistically significant Single Nucleotide Polymorphisms (SNPs). Genome-wide association studies (GWAS) use single-locus analysis where each SNP is independently tested for association with phenotypes. The limitation with this approach, however, is its inability to explain genetic variation in complex diseases. Alternative approaches are required to model the intricate relationships between SNPs. Our proposed approach extends GWAS by combining deep learning stacked autoencoders (SAEs) and association rule mining (ARM) to identify epistatic interactions between SNPs. Following traditional GWAS quality control and association analysis, the most significant SNPs are selected and used in the subsequent analysis to investigate epistasis.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
