# External Validation of Lung Cancer Prediction Models Combining Epidemiological Predictors in Chinese Ever and Never Smokers: Guangzhou Biobank Cohort Study

**Authors:** Bo Xing Feng, Xin Yue Pan, Jing Ru Huang, Chao Qiang Jiang, Wei Sen Zhang, Feng Zhu, Jing Pan, Tai Hing Lam

PMC · DOI: 10.1002/cam4.71104 · Cancer Medicine · 2025-07-31

## TL;DR

This study tested how well various lung cancer prediction models work for Chinese smokers and non-smokers, finding that some models outperform existing screening criteria.

## Contribution

The study provides an external validation of lung cancer models in a Chinese population, comparing their performance across smokers and non-smokers.

## Key findings

- Several models like LLP version 2, LLP version 3, OWL, LCRS, PLCOall2014, and LLPi performed better in ever smokers than in never smokers.
- Risk prediction models identified more high-risk individuals than existing screening criteria like T/CPMA 013-2020, USPSTF-2021, and NELSON.
- Replacing screening criteria with risk prediction models may improve lung cancer screening efficiency.

## Abstract

This study aimed to externally validate existing lung cancer models using data from the Guangzhou Biobank Cohort Study (GBCS) and compare their predictive performance for Chinese ever and never smokers.

We evaluated the discrimination and calibration of LCRAT (Lung Cancer Risk Assessment Tool), LLP version 2 (Liverpool Lung Project version 2), LLP version 3 (Liverpool Lung Project version 3), HUNT (HUNT was derived from the Nord‐Trøndelag Health Study), OWL (Optimized Early Warning Model for Lung Cancer Risk), LCRS (Lung Cancer Risk Score), PLCOm2012 (Prostate, Lung, Colorectal, and Ovarian 2012 model), PLCOall2014 (Prostate, Lung, Colorectal, and Ovarian 2014 model), NHIS (Korean National Health Insurance Service), LLPi (Liverpool Lung Project Risk Prediction Model for Lung Cancer Incidence), Pittsburgh, and Bach models. We compared the performance of models and Chinese lung cancer screening (T/CPMA 013‐2020), US Preventive Services Task Force 2021 (USPSTF‐2021) and Nederlands–Leuvens Longkanker Screenings Onderzoek (NELSON) criteria.

The LLP version 2, LLP version 3, OWL, LCRS, PLCOall2014, and LLPi models showed better performance in ever smokers than in never smokers, with higher AUC (0.72–0.82 vs. 0.69–0.71) and E/O (expected to observed) ratios (0.57–0.79 vs. 0.60–0.69), while the LCRAT, HUNT, PLCOm2012, NHIS, Pittsburgh, and Bach models showed good performance in ever smokers, with AUC ranging from 0.70 to 0.79 and E/O ratios from 0.57 to 0.75. The T/CPMA 013‐2020, USPSTF‐2021, and NELSON criteria identified 56.52%–75.58% of high‐risk individuals at 5, 6, 6.6, 8.7, and 10 years, while the LCRAT, LLP version 2, LLP version 3, HUNT, OWL, LCRS, PLCOm2012, PLCOall2014, NHIS, LLPi, Pittsburgh, and Bach models identified 70.70%–89.72% of high‐risk individuals.

Most lung cancer risk prediction models showed good performance and identified more cases than screening criteria. Replacing screening criteria with risk prediction models may increase lung cancer screening efficiency.

## Linked entities

- **Diseases:** lung cancer (MONDO:0005138)

## Full-text entities

- **Genes:** ADA2 (adenosine deaminase 2) [NCBI Gene 51816] {aka ADGF, CECR1, IDGFL, PAN, SNEDS, VAIHS}
- **Diseases:** carcinogens (MESH:D011230), Prostate (MESH:D011472), diabetes mellitus (MESH:D003920), LCRS (MESH:D008175), emphysema (MESH:D004646), NHIS (OMIM:603663), cough (MESH:D003371), interstitial pulmonary disease (MESH:D017563), chronic bronchitis (MESH:D029481), Cancer (MESH:D009369), COPD (MESH:D029424), Colorectal, and Ovarian (MESH:D010049)
- **Chemicals:** asbestos (MESH:D001194), cadmium (MESH:D002104), radon (MESH:D011886), silica (MESH:D012822), alcohol (MESH:D000438), chromium (MESH:D002857), beryllium (MESH:D001608)
- **Species:** Homo sapiens (human, species) [taxon 9606], Nicotiana tabacum (American tobacco, species) [taxon 4097]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12311613/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12311613/full.md

## References

34 references — full list in the complete paper: https://tomesphere.com/paper/PMC12311613/full.md

---
Source: https://tomesphere.com/paper/PMC12311613