# Integrating Polygenic Scores with Clinical, Lifestyle, and Social Risk Factors to Improve Heart Failure Risk Prediction

**Authors:** Katie M. Cardone, Dokyoon Kim, Marylyn D. Ritchie

PMC · DOI: 10.1142/9789819824755_0046 · Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing · 2026-03-02

## TL;DR

This study shows that combining genetic and non-genetic factors improves the prediction of heart failure risk, offering opportunities for early intervention.

## Contribution

The study introduces a novel approach integrating polygenic scores with clinical, lifestyle, and social risk factors for heart failure prediction.

## Key findings

- The integrated model (PGS + CRS + PXS) outperformed individual risk scores in predicting heart failure.
- Including PGS with clinical and exposure risk factors as independent features improved model performance based on AUPRC and F1 score.
- Combining multiple domains of risk factors enhances heart failure risk prediction accuracy.

## Abstract

Heart failure (HF) is highly prevalent, high-burden disorder with its prevalence expected to increase. Early detection of HF can reduce morbidity and mortality; therefore, novel early detection methods are needed. Polygenic scores (PGS) can combine common variants across the genome and provide phenotype-specific risk scores. However, there are also many well-known, non-genomic risk factors of HF, in the clinical, lifestyle, and social determinant of health (SDOH) domains, and it is not clear how genetic and non-genetic risk factors collectively contribute to HF risk. To address this question, we assessed whether combining HF PGS with clinical, lifestyle, and SDOH risk factors improves risk prediction. Leveraging data from the All of Us Research Program (n = 22,275), clinical risk factors were aggregated into a clinical risk score (CRS) while lifestyle and SDOH risk factors were aggregated into a polyexposure score (PXS). Feature selection was conducted with LASSO regression and statistical significance thresholding from logistic regression models (p < 0.05). Features were included in the model if they were statistically significant and important in ≥ 95% of 1000 iterations. To assess model performance, logistic regressions with HF case/control status were conducted with each risk score individually, as well as integrated models. The integrated model (PGS + CRS + PXS) performed better than individual risk scores (AUROC = 0.763, AUPRC = 0.047, F1 score = 0.062, balanced accuracy = 0.683). To assess the validity of the CRS and PXS, an integrated model with the PGS along with clinical and exposure risk factors as independent features was also evaluated. Based on AUPRC and F1 score, this integrated risk model (PGS + CRS risk factors + PXS risk factors) performed better than the combining the PGS with the CRS and PXS (AUROC = 0.738, AUPRC = 0.047, F1 score = 0.066, balanced accuracy = 0.657). These findings demonstrate that integration of risk factors across multiple domains can improve HF prediction. Knowing that PGS combined with clinical, lifestyle, and SDOH risk factors is predictive of HF risk provides greater opportunity for the identification of individuals at risk of HF prior to disease onset with the goal of prevention or early intervention.

## Linked entities

- **Diseases:** heart failure (MONDO:0005252)

## Full-text entities

- **Diseases:** type I and type II (MESH:D006969), bipolar disorder (MESH:D001714), cardiomyopathies (MESH:D009202), smoking (MESH:D015208), obesity (MESH:D009765), breast cancer (MESH:D001943), aortic stenosis (MESH:D001024), HF (MESH:D006333), Hypercholesterolemia (MESH:D006937), familial cardiomyopathies (MESH:C536231), T2D (MESH:D003924), hereditary disease (MESH:D030342), CRS (MESH:D000075902), atherosclerosis (MESH:D050197), smoker (MESH:C000719328), SDOH (MESH:D003643), hypertension (MESH:D006973), hyperlipidemia (MESH:D006949), schizophrenia (MESH:D012559), diabetes (MESH:D003920), AF (MESH:D001281), ischemic heart disease (MESH:D017202), CVD (MESH:D002318)
- **Chemicals:** lipid (MESH:D008055), cholesterol (MESH:D002784), Glucose (MESH:D005947), Triglycerides (MESH:D014280)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12952681/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12952681/full.md

## References

51 references — full list in the complete paper: https://tomesphere.com/paper/PMC12952681/full.md

---
Source: https://tomesphere.com/paper/PMC12952681