# Multi-level phenotypic models of cardiovascular disease and obstructive sleep apnea comorbidities: A longitudinal Wisconsin sleep cohort study

**Authors:** Duy Nguyen, Ca Hoang, Tien Truong, Dang Nguyen, Hillary Gia Lam, Abhay Sharma, Trung Quoc Le, Phat Kim Huynh, Amir Hossein Behnoush, Amir Hossein Behnoush, Amir Hossein Behnoush

PMC · DOI: 10.1371/journal.pone.0327977 · PLOS One · 2025-07-15

## TL;DR

This study introduces a new model to predict how cardiovascular disease and sleep apnea interact over time, using data from a long-term sleep cohort.

## Contribution

A novel multi-level phenotypic model is proposed to dynamically predict CVD progression in OSA patients.

## Key findings

- Total cholesterol, LDL, and diabetes were top predictors of CVD outcomes.
- LGMM models achieved high diagnostic accuracy (0.9556) in tracking individual disease trajectories.
- Two distinct patient clusters were identified, with one showing higher risk for major adverse cardiovascular events.

## Abstract

Cardiovascular diseases (CVDs) are prevalent among obstructive sleep apnea (OSA) patients, presenting significant challenges in predictive modeling due to the complex interplay of these comorbidities. Current methodologies predominantly lack the dynamic and longitudinal perspective necessary to accurately predict CVD progression in the presence of OSA. This study addresses these limitations by proposing a novel multi-level phenotypic model that analyzes the progression and interaction of these comorbidities over time. Our study utilizes a longitudinal cohort from the Wisconsin sleep cohort, consisting of 1,123 participants, tracked over several decades. The methodology consists of three advanced steps to capture the relationships between these comorbid conditions: (1) performing feature importance analysis using tree-based models to highlight the predominant role of variables in predicting CVD outcomes. (2) developing a logistic mixed-effects model (LGMM) to identify longitudinal transitions and their significant factors, enabling detailed tracking of individual trajectories; (3) and utilizing t-distributed stochastic neighbor embedding (t-SNE) combined with Gaussian mixture models (GMM) to classify patient data into distinct phenotypic clusters. In the analysis of feature importance, clinical indicators such as total cholesterol, low-density lipoprotein, and diabetes emerged as the top predictors, highlighting their significant roles in CVD onset and progression. The LGMM predictive models exhibited a high diagnostic accuracy with an aggregate accuracy of 0.9556. The phenotypic analysis yielded two distinct clusters, each corresponding to unique risk profiles and disease progression pathways. One cluster notably carried a higher risk for major adverse cardiovascular events (MACEs), attributed to key factors like nocturnal hypoxia and sympathetic activation. Analysis using t-SNE and GMM confirmed these phenotypes, which marked differences in progression rates between the clusters. In conclusion, our study provides a profound understanding of the dynamic OSA-CVD interactions, offering robust tools for predicting CVD onset and informing personalized treatment strategies.

## Linked entities

- **Diseases:** obstructive sleep apnea (MONDO:0007147), diabetes (MONDO:0005015)

## Full-text entities

- **Diseases:** hypoxia (MESH:D000860), OSA (MESH:D020181), CVDs (MESH:D002318), diabetes (MESH:D003920)
- **Chemicals:** cholesterol (MESH:D002784)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12262892/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12262892/full.md

## References

85 references — full list in the complete paper: https://tomesphere.com/paper/PMC12262892/full.md

---
Source: https://tomesphere.com/paper/PMC12262892