Are gene-by-environment interactions leveraged in multi-modality neural networks for breast cancer prediction?
Monica Isgut, Andrew Hornback, Yunan Luo, Asma Khimani, Neha Jain, May, D. Wang

TL;DR
This study investigates whether gene-by-gene and gene-by-environment interactions can improve breast cancer risk prediction using neural networks, finding significant interactions but no performance improvement over baseline models.
Contribution
First application of neural networks to identify gene and environment interactions affecting breast cancer risk using real-world data.
Findings
Identified 248 significant gene-gene and gene-environment interactions.
Most impactful interactions involved rs6001930 (MKL1) and rs889312 (MAP3K1).
Interactions with age and menopause were most prominent.
Abstract
Polygenic risk scores (PRSs) can significantly enhance breast cancer risk prediction when combined with clinical risk factor data. While many studies have explored the value-add of PRSs, little is known about the potential impact of gene-by-gene or gene-by-environment interactions towards enhancing the risk discrimination capabilities of multi-modal models combining PRSs with clinical data. In this study, we integrated data on 318 individual genotype variants along with clinical data in a neural network to explore whether gene-by-gene (i.e., between individual variants) and/or gene-by-environment (between clinical risk factors and variants) interactions could be leveraged jointly during training to improve breast cancer risk prediction performance. We benchmarked our approach against a baseline model combining traditional univariate PRSs with clinical data in a logistic regression model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGene expression and cancer classification · AI in cancer detection · Bioinformatics and Genomic Networks
MethodsLogistic Regression
