Deep neural network improves the estimation of polygenic risk scores for breast cancer
Adrien Badr\'e, Li Zhang, Wellington Muchero, Justin C. Reynolds,, Chongle Pan

TL;DR
This study demonstrates that a deep neural network significantly improves breast cancer polygenic risk score estimation over traditional methods, effectively identifying high-risk individuals and revealing non-linear genetic relationships.
Contribution
The paper introduces a deep neural network model that outperforms existing statistical and machine learning methods in estimating breast cancer PRS, capturing non-linear genetic effects.
Findings
DNN achieved higher AUC (67.4%) than other models.
DNN's PRS distribution was bi-modal, indicating better risk stratification.
DNN identified variants with non-linear effects not detected by association studies.
Abstract
Polygenic risk scores (PRS) estimate the genetic risk of an individual for a complex disease based on many genetic variants across the whole genome. In this study, we compared a series of computational models for estimation of breast cancer PRS. A deep neural network (DNN) was found to outperform alternative machine learning techniques and established statistical algorithms, including BLUP, BayesA and LDpred. In the test cohort with 50% prevalence, the Area Under the receiver operating characteristic Curve (AUC) were 67.4% for DNN, 64.2% for BLUP, 64.5% for BayesA, and 62.4% for LDpred. BLUP, BayesA, and LPpred all generated PRS that followed a normal distribution in the case population. However, the PRS generated by DNN in the case population followed a bi-modal distribution composed of two normal distributions with distinctly different means. This suggests that DNN was able to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
