# Multimodal data-driven multitask learning for enhanced identification and classification of chronic obstructive pulmonary disease: a retrospective study

**Authors:** Qian Wu, Hui Guo, Ruihan Li, Jinhuan Han, Zhen Zhang, Ayajiang Jingesi, Shuqin Kang

PMC · DOI: 10.7189/jogh.16.04028 · 2026-01-23

## TL;DR

This study develops a machine learning model that uses CT scans and clinical data to better detect and classify COPD, improving accuracy compared to traditional methods.

## Contribution

The novel contribution is a multimodal multitask learning framework that integrates CT and clinical data for COPD identification and classification.

## Key findings

- The model achieved a CCC of 0.77 for FEV1 and 0.75 for FVC with low MAE values.
- Binary COPD classification reached an AUC of 0.88 and ACC of 0.83.
- Ternary classification achieved an AUC of 0.87 and ACC of 0.79.

## Abstract

Chronic obstructive pulmonary disease (COPD), the third leading cause of death worldwide, demands prompt and precise identification and phenotyping for effective management. This study aims to develop a multimodal multi-task learning framework that concurrently performs automated detection and classification of COPD.

Retrospective multi-task model fusing computed tomography (CT) and clinical data (n = 2320) at a tertiary hospital. Predictive performance for lung-function metrics was assessed using the concordance correlation coefficient (CCC) and mean absolute error (MAE). Classification efficacy was evaluated via the area under the receiver operating characteristic curve (AUC), accuracy (ACC), precision, recall, and F1-score. Generalisability was further verified by replicating the experiments on three distinct backbone networks.

This study included 1624 patients for model training, 348 patients for the validation set, and an additional 348 patients for the independent test set. The optimal model achieved a maximum CCC of 0.75 for forced vital capacity (FVC), corresponding to an MAE of 0.37, and a maximum CCC of 0.77 for forced expiratory volume in one second (FEV1), corresponding to an MAE of 0.33. For the binary classification task (COPD/Non-COPD), the highest AUC achieved through multi-task learning was 0.88, with a maximum ACC of 0.83. In the ternary classification task (COPD/preserved ratio impaired spirometry (PRISm)/Normal), the highest AUC reached 0.87, with a maximum ACC of 0.79.

Multitask-learning models that integrate chest CT images with basic clinical variables outperform their single-task counterparts in both the identification and classification of COPD. This approach supports evidence-based clinical decision-making and advances the delivery of precision medicine.

## Linked entities

- **Diseases:** chronic obstructive pulmonary disease (MONDO:0005002), COPD (MONDO:0005002)

## Full-text entities

- **Diseases:** joint disease (MESH:D007592), impaired lung function (MESH:D003072), death (MESH:D003643), PRISm (MESH:C537758), airway obstruction (MESH:D000402), COPD (MESH:D029424), Chronic respiratory diseases (MESH:D012140), chronic (MESH:D002908), emphysema (MESH:D004646), lung disease (MESH:D008171), inflammatory (MESH:D007249), impaired spirometry (MESH:D060825), thoracic disease (MESH:D013896), CT (MESH:C000719218), lung cancer (MESH:D008175), emphysematous destruction (MESH:D041882), AI (MESH:C538142)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12828439/full.md

---
Source: https://tomesphere.com/paper/PMC12828439