Bregman Divergence-Based Data Integration with Application to Polygenic Risk Score (PRS) Heterogeneity Adjustment
Qinmengge Li, Matthew T. Patrick, Haihan Zhang, Chachrit, Khunsriraksakul, Philip E. Stuart, Johann E. Gudjonsson, Rajan Nair, James T., Elder, Dajiang J. Liu, Jian Kang, Lam C. Tsoi, Kevin He

TL;DR
This paper introduces a Bregman divergence-based method to improve polygenic risk score predictions for minority populations by effectively integrating data from different populations using only summary statistics.
Contribution
It proposes a novel Bregman divergence-based estimation procedure that enhances PRS accuracy for minority groups without requiring individual-level data.
Findings
Improves prediction accuracy for minority populations.
Demonstrates advantages in variable selection.
Shows asymptotic consistency and weak oracle property.
Abstract
Polygenic risk scores (PRS) have recently received much attention for genetics risk prediction. While successful for the Caucasian population, the PRS based on the minority population suffer from small sample sizes, high dimensionality and low signal-to-noise ratios, exacerbating already severe health disparities. Due to population heterogeneity, direct trans-ethnic prediction by utilizing the Caucasian model for the minority population also has limited performance. In addition, due to data privacy, the individual genotype data is not accessible for either the Caucasian population or the minority population. To address these challenges, we propose a Bregman divergence-based estimation procedure to measure and optimally balance the information from different populations. The proposed method only requires the use of encrypted summary statistics and improves the PRS performance for ethnic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetic Associations and Epidemiology · Genetic and phenotypic traits in livestock · Cancer-related molecular mechanisms research
