# Prediction of Chronic Obstructive Pulmonary Disease Using Machine Learning, Clinical Summary Notes, and Vital Signs: A Single-Center Retrospective Cohort Study in the United States

**Authors:** Sabrina Meng, Hersh Sagreiya, Negar Orangi-Fard

PMC · DOI: 10.3390/arm94010005 · Advances in Respiratory Medicine · 2026-01-07

## TL;DR

A machine learning model using clinical notes and vital signs predicted COPD exacerbations with 84% accuracy, offering potential for early intervention and better patient outcomes.

## Contribution

A novel machine learning framework integrating unstructured clinical notes and structured vital signs for COPD exacerbation prediction is proposed.

## Key findings

- A clinical note-based support vector machine model achieved an AUC of 0.81 and 84.0% accuracy in predicting COPD exacerbations.
- Combining unstructured clinical notes with structured vital signs data improved the early detection of COPD exacerbation risk.
- Patient monitor and hospital information system data provided sufficient information for accurate COPD exacerbation prediction.

## Abstract

What are the main findings?
Of the COPD exacerbation predictive models designed and assessed in our study, the clinical note-based support vector machine model achieved an AUC of 0.81 and accuracy of 84.0% in predicting COPD exacerbations.

Of the COPD exacerbation predictive models designed and assessed in our study, the clinical note-based support vector machine model achieved an AUC of 0.81 and accuracy of 84.0% in predicting COPD exacerbations.

What are the implications of the main findings?
Clinically available patient data, clinical notes, and vital signs can effectively predict COPD exacerbations, potentially enabling earlier interventions, improved outcomes, and reduced healthcare burden.Integration of unstructured clinical notes with structured vital signs data using ML frameworks may improve early detection of COPD exacerbation risk.

Clinically available patient data, clinical notes, and vital signs can effectively predict COPD exacerbations, potentially enabling earlier interventions, improved outcomes, and reduced healthcare burden.

Integration of unstructured clinical notes with structured vital signs data using ML frameworks may improve early detection of COPD exacerbation risk.

Introduction: Chronic obstructive pulmonary disease (COPD) is a leading cause of morbidity and mortality. Early identification and timely intervention for COPD exacerbations can reduce hospitalizations and complications, as well as improve patient outcomes. Methods: To develop and evaluate predictive models for COPD exacerbations using machine learning (ML), we performed a retrospective study using intensive care unit patient records. Records including 31,667 clinical notes and 10,489 vital signs were used to train and validate two machine learning models to predict COPD exacerbations in patients with known or suspected COPD. Predictive performance was evaluated for support vector machine, quadratic discriminant analysis, and adaptive boosting algorithms using area under the receiver operating characteristic curve (AUC). Results: The clinical note-based support vector machine model achieved an AUC of 0.81 and accuracy of 84.0% in predicting COPD exacerbations. Data from patient monitors and hospital information systems provided sufficient information for accurate prediction, demonstrating the utility of combining physiological signals with clinical text data. Discussion: Clinically available patient data and vital signs can effectively predict COPD exacerbations, potentially enabling earlier interventions, improved outcomes, and reduced healthcare burden. These findings suggest that integrating unstructured clinical notes with structured vital signs using ML frameworks may improve early detection of exacerbation risk, thus enabling appropriate patient counseling, triage, and treatment based on COPD severity.

## Linked entities

- **Diseases:** Chronic obstructive pulmonary disease (MONDO:0005002), COPD (MONDO:0005002)

## Full-text entities

- **Diseases:** COPD (MESH:D029424)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12821515/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12821515/full.md

## References

63 references — full list in the complete paper: https://tomesphere.com/paper/PMC12821515/full.md

---
Source: https://tomesphere.com/paper/PMC12821515