A Novel Approach to Encode Two-Way Epistatic Interactions Between Single Nucleotide Polymorphisms
Nathaniel Gunter, Prashanthi Vemuri, Vijay Ramanan, Robel K, Gebre

TL;DR
This paper introduces a new method for encoding gene-gene interactions in genetic risk models, preserving individual SNP information while capturing interactions, which enhances interpretability and model performance.
Contribution
The paper proposes three methods to encode epistatic interactions that retain SNP information, improving upon simple interaction models and facilitating better genetic risk assessment.
Findings
Interaction-preserving methods outperform simple interaction models.
Raw SNP genotypes are sufficient for complex ML models.
Explicit interaction encoding aids interpretability of genetic pathways.
Abstract
Modelling gene-gene epistatic interactions when computing genetic risk scores is not a well-explored subfield of genetics and could have potential to improve risk stratification in practice. Though applications of machine learning (ML) show promise as an avenue of improvement for current genetic risk assesments, they frequently suffer from the problem of two many features and to little data. We propose a method that when combined with ML allows information from individual genetic contributors to be preserved while incorporating information on their interactions in a single feature. This allows second-order analysis, while simultaneously increasing the number of input features to ML models as little as possible. We presented three methods that can be utilized to account for genetic interactions. We found that interaction methods that preserved information from the constituent SNPs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology · Bioinformatics and Genomic Networks · Genetic and phenotypic traits in livestock
