Association of automated quantified emphysema and interstitial lung abnormality with survival in non-small cell lung cancer
Guangjing Weng, Junli Tao, Yu Pu, Changyu Liang, Bohui Chen, Zhenyu Wang, Chengzhan Qi, Jiuquan Zhang

TL;DR
This study shows that AI-measured lung abnormalities are linked to survival rates in non-small cell lung cancer patients.
Contribution
AI quantification of emphysema and ILA improves survival prediction beyond traditional staging in NSCLC.
Findings
Whole emphysema and ILA severity are independent prognostic factors for NSCLC survival.
AI-quantified emphysema and ILA outperform TNM staging in predicting patient outcomes.
Regional emphysema and ILA severity do not show significant prognostic value.
Abstract
To investigate the prognostic value of artificial intelligence (AI) quantified emphysema and interstitial lung abnormality (ILA) in patients with non-small cell lung cancer (NSCLC). This retrospective study used AI to quantify emphysema and ILA in patients diagnosed with NSCLC between January 2015 and December 2017. Associations between AI-quantified emphysema and ILA severity and overall survival (OS) were evaluated using Cox proportional hazards models. The ability of AI-quantified emphysema and ILA severity to predict OS was explored via concordance index (C-index) and area under the time-dependent receiver operating characteristic curve (AUC). Furthermore, exploratory OS analyses were performed on subgroups stratified by chronic obstructive pulmonary disease status, treatment type, and tumor-node-metastasis (TNM) staging. Of 1675 patients, 830 (49.6%) survived, and 845 (50.4%)…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7- —the Project supported by graduate research and innovation foundation of Chongqing, China
- —Chongqing Key Medical Research Project of Combination of Science and Medicine
- —the Natural Science Foundation of Chongqing municipality
- —the Chongqing Education Commission science and technology research project
- —the Chongqing Medical Research Project of Combination of Science and Medicine
- —http://dx.doi.org/10.13039/501100001809National Natural Science Foundation of China
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInterstitial Lung Diseases and Idiopathic Pulmonary Fibrosis · Lung Cancer Diagnosis and Treatment · Radiomics and Machine Learning in Medical Imaging
Introduction
Approximately 85% of lung cancer cases are non-small cell lung cancer (NSCLC), with a 5-year overall survival (OS) rate of only 25% [1]. The prognostic stratification of NSCLC primarily relies on tumor-node-metastasis (TNM) staging [2]. However, TNM staging is not an optimal prognostic tool for NSCLC due to the clinical heterogeneity of patients at the same stage [3]. Therefore, it is important to investigate additional indicators to improve NSCLC prognosis stratification.
Comorbidities are associated with NSCLC survival, with chronic obstructive pulmonary disease (COPD) representing the most common lung cancer comorbidity [4]. A recent study reported that COPD was correlated with worse OS rate in NSCLC patients [5]. Pulmonary function test (PFT) is the gold standard for diagnosing COPD. However, 10–95% of patients with lung function impairment have normal PFT, suggesting that PFT potentially leads to the underdiagnosis of COPD [6, 7]. Additionally, PFT depends on patient cooperation, and some lung cancer patients cannot tolerate the test [8]. Thus, it is necessary to develop an accurate and acceptable approach for providing information complementary to PFT.
Emphysema is recognized as the typical phenotype of COPD [9]. The latest Global Initiative for Chronic Obstructive Lung Disease report indicated that interstitial lung abnormality (ILA) is closely associated with COPD [10]. So, emphysema and ILA may be potential markers of COPD. Chest CT is regarded as the optimal non-invasive method for assessing emphysema and ILA [11, 12]. Previous studies have shown that NSCLC without emphysema and ILA per CT visual assessment had a favorable outcome [13, 14]. Furthermore, more severe emphysema and ILA are associated with worse prognoses [15, 16]. The outcome of NSCLC patients with severe regional emphysema has also been found to be poorer than that of patients with mild and moderate [17]. However, visual subjective evaluation is time-consuming, has poor reproducibility and requires dedicated training [12]. As artificial intelligence (AI) advances in medical imaging, AI-quantified emphysema and ILA are being explored to overcome these limitations [18, 19].
To our knowledge, this is the first study to evaluate the prognostic value of AI-quantified emphysema and ILA in NSCLC survival. The purpose of this study was to assess whether AI-quantified emphysema and ILA are associated with NSCLC prognosis. Furthermore, we investigated whether AI-quantified emphysema and ILA severity improve OS prediction, independent of TNM staging alone. Additionally, exploratory OS analyses were conducted in subgroups stratified by COPD status, treatment type, and TNM staging.
Materials and methods
This study was approved by the institutional review boards of our hospital and waived the requirement for written informed consent (CZLS2022109-A).
Study sample and follow-up
We collected baseline data from patients diagnosed with NSCLC between January 2015 and December 2017. The data consisted of demographics (age and sex), clinical data (smoking history, family history, hypertension, diabetes mellitus, coronary artery calcification and treatment type), serum tumor markers (carcinoembryonic antigen and carbohydrate antigen 125), pathological data (TNM staging and histologic subtype), and radiological data (tumor size and location were documented according to the baseline radiological report). We also analyzed PFT parameters measured within 2 weeks of the chest CT, including forced expiratory volume in the first second (FEV₁), forced vital capacity (FVC), diffusing capacity of the lung for carbon monoxide (DLCO), and the FEV₁/FVC ratio. A postbronchodilator FEV1/FVC < 0.70 was used to confirm the diagnosis of COPD [20].
The inclusion criteria were as follows: (a) histopathologically confirmed NSCLC; (b) baseline chest thin-section CT images (≤ 1.25 mm) available on our Picture Archiving and Communication System; (c) complete baseline data; and (d) no previous clinical suspicion of interstitial lung disease. The exclusion criteria were (a) presence of other primary malignancies; (b) poor image quality that was inadequate for evaluation; and (c) missing follow-up or inability to obtain follow-up outcome. This study’s outcome was the 5-year OS rate, defined as the time from diagnosis to the date of death from any cause or the date of the last follow-up.
Chest CT acquisition
Chest CT scans were performed utilizing the 64-multidetector Philips Brilliance CT. The Supplementary Materials provide a detailed chest CT scan and settings parameters.
Emphysema and ILA visual assessment
Visual assessment of baseline chest CT images was used to determine the presence of whole emphysema and ILA and their degrees of severity. Visual assessment criteria are detailed in the Supplementary Materials.
AI-quantified emphysema and ILA
AI quantification software [19, 21, 22] (A-VIEW, version 1.1.46.11, Suhai Alderi Information Technology Ltd.) was used to analyze emphysema and ILA in baseline chest CT images. Firstly, we uploaded the CT images into the software. Then, the AI automatically quantified the ratio of whole emphysema and further segmented the lobar (right upper, right middle, right lower, left upper, and left lower) to quantify the regional emphysema ratio. Regarding ILA analysis, the lung parenchyma was segmented into six zones according to anatomical landmarks (levels of the inferior aortic arch and right inferior pulmonary vein), and the AI automatically quantified the ILA ratios in these zones [11]. We obtained the following parameters: whole emphysema ratio, regional emphysema ratio, and ILA ratio in each lung zone.
The participants were divided into three categories according to the whole emphysema ratio: (1) non-emphysema; (2) mild (less than 5%); and (3) more than mild (more than 5%) [23, 24]. Furthermore, the patients were categorized according to the regional emphysema ratio into the no, low (emphysema ratio < 1% in the lobar with tumor) and high (emphysema ratio ≥ 1% in the lobar with tumor) groups [25]. Regarding ILA severity, the participants were classified into three categories: (1) non-ILA; (2) equivocal ILA (the presence of any nondependent changes affecting < 5% of any lung zone or with unilateral changes); and (3) definite ILA (the presence of any nondependent changes affecting ≥ 5% of any lung zone) [11, 26, 27]. As the regional ILA severity criteria have not been confirmed, the optimal cutoff value of regional ILA severity was determined using X-tile software (version 3.6.1; Yale University School of Medicine) as previously described [3, 28–30]. The patients were subsequently divided into three categories according to regional ILA ratio: no, low (ILA ratio < 1.2% in the lung zone with tumor), and high (ILA ratio ≥ 1.2% in the lung zone with tumor).
The schematic diagrams of AI-quantified emphysema and ILA were presented in Figs. 1 and 2, respectively.Fig. 1. Example outputs for artificial intelligence (AI) quantified emphysema. A threshold of −950 HU was used to define emphysema. The AI-quantified outputs label the CT images to indicate the severity of emphysema, and a quantification table of emphysema is also provided. Axial CT images in participants with non-emphysema (a–c), mild emphysema (d–f), and more than mild emphysema (g–i). Individuals with mild emphysema and more than mild emphysema exhibit increased CT emphysema voxels (green areas) versus those without emphysemaFig. 2Example outputs for artificial intelligence (AI) quantified interstitial lung abnormality (ILA). For ILA measurement, the lung parenchyma was segmented into six lung zones, demarcated by the upper, middle, and lower lung zones, which are defined by the levels of the inferior aortic arch and the right inferior pulmonary vein. The AI-quantified outputs label the CT images to indicate ILA severity and provide a quantification schematic diagram of ILA. Coronal CT images in participants with non-ILA (a–c), equivocal ILA (d–f), and definite ILA (g–i). Equivocal ILA and definite ILA exhibit increased green and yellow areas versus those observed without ILA
Survival analysis
Kaplan–Meier curves were produced and compared using the log-rank test. Univariable and multivariable Cox regression analysis were used to estimate hazard ratio (HR) and corresponding 95% confidence intervals (CIs). The multiplicative interaction between emphysema and ILA was determined using a likelihood ratio test to compare the Cox models with and without the emphysema and ILA of the interaction term. In addition, additive interactions were assessed using the relative excess risk due to interaction and the attributable proportion of the interaction that was obtained from the Cox model with the interaction term.
Sensitivity analysis
To evaluate the reliability of the optimal threshold for regional ILA severity determined by X-tile, three sets of sensitivity analyses were undertaken: (1) we reanalyzed the association between regional ILA severity and 5-year OS by excluding participants whose death occurred within the first years of follow-up in order to minimize reverse causality; (2) we considered that the OS of NSCLC differed according to age, sex, smoking history and tumor size, and therefore stratified participants according to age (< 65 years, ≥ 65 years), sex, smoking history and tumor size (< 3 cm, ≥ 3 cm); and (3) we used the multiple imputations were used for the imputation of covariates to assess whether the excluded patients with missing covariate values would influence our results [31].
Added value of AI-quantified emphysema and ILA over TNM staging alone
We constructed 7 prognostic predictive models per the multivariable analysis: Model 1: TNM staging; Model 2: whole emphysema severity; Model 3: ILA severity; Model 4: TNM staging and whole emphysema severity; Model 5: TNM staging and ILA severity; Model 6: whole emphysema severity and ILA severity; Model 7: TNM staging, whole emphysema severity and ILA severity.
Time-dependent receiver operating characteristic (ROC) curves were used to evaluate the performance of the models. The DeLong test was applied to compare the areas under the ROC curves (AUC) for 5-year OS between the best-performing model and the others. The concordance index (C-index) and Akaike Information Criterion (AIC) assessed model goodness-of-fit, while calibration curves with the Hosmer-Lemeshow test were applied to evaluate the agreement between predicted and observed risks and to assess model robustness. Clinical efficacy was evaluated by decision curve analysis. Shapley additive explanations plots were utilized to visualize the contribution of Model 7 selected factors to OS.
Relationship between AI-quantified metrics and surgical resection by stages I–IIIA
We explored the AI-quantified emphysema and ILA’s potential impact on eligibility for surgery in patients with resectable NSCLC (stages I–IIIA). Logistic regression analysis was fitted for the odds ratio (OR) of surgical resection by AI-quantified emphysema and ILA. We distinguish between stage IA and stages IB–IIIA because there is no indication for adjuvant treatment after surgery for stage IA, while adjuvant therapy is considered for stages IB–IIIA [5].
Subgroup analysis
Participants were stratified by: (1) COPD status (non-COPD and COPD groups); (2) treatment type (non-surgery and surgery groups); and (3) TNM staging criteria (stages I–IIIA and stages IIIB–IV groups) [13, 32, 33]. The Supplementary Materials provide detailed subgroup analysis.
Statistical analysis
Details of the variables analysis, the agreement and correlation analysis between the observer and AI were provided in the Supplementary Materials. The visual analysis of emphysema severity was classified into 5 categories by the Fleischner Society. However, due to the limited number of patients with trace, confluent and advanced destructive emphysema, we divided the patients with emphysema into 2 categories (mild and more than mild) to maintain sufficient statistical power [34]. All statistical analyses were performed by using SPSS Statistics (version 27.0; IBM Corp.) and R software (version 4.0.0; R Foundation for Statistical Computing). All p-values were two-sided, and p < 0.05 was considered to indicate significance.
Results
Participant characteristics
The original cohort comprised 1991 participants, excluding 316 participants, 1675 were ultimately included (Fig. 3). Of the 1675 patients, 830 (49.6%) survived and 845 (50.4%) died. The median age (interquartile range) of the survival group was 57 (50–64) years, and 49.2% (n = 480) of the patients were female. Among the participants who died, 513 were men (60.7%) and 332 were women (39.3%); the median age was 62 (54–68) years. The baseline characteristics are shown in Table 1.Fig. 3. Flowchart showing study sample selection and inclusion. COPD, chronic obstructive pulmonary disease; TNM, tumor-node-metastasisTable 1Baseline characteristics on the basis of mortalityParameterSurvival (n = 830)Death (n = 845)p-valueAge (years)56.02 ± 11.4660.08 ± 9.71< 0.001Sex< 0.001 Male422 (50.8)513 (60.7) Female408 (49.2)332 (39.3)Smoking history< 0.001 No519 (62.5)446 (52.8) Yes311 (37.5)399 (47.2)Family history0.01 No794 (95.7)784 (92.8) Yes36 (4.3)61 (7.2)Hypertension0.01 No725 (87.3)698 (82.6) Yes105 (12.7)147 (17.4)Diabetes mellitus0.78 No747 (90.0)764 (90.4) Yes83 (10.0)81 (9.6)Coronary artery calcification0.07 No766 (92.3)758 (89.7) Yes64 (7.7)87 (10.3)CEA (ng/mL)< 0.001 Normal (≤ 5)701 (84.5)533 (63.1) Increased (> 5)129 (15.5)312 (36.9)CA125 (U/mL)< 0.001 Normal (≤15)663 (79.9)500 (59.2) Increased (> 15)167 (20.1)345 (40.8)Tumor size (cm)< 0.001 ≤ 3505 (60.8)305 (36.1) > 3325 (39.2)540 (63.9)Tumor location< 0.001 Peripheral682 (82.2)548 (65.1) Central148 (17.8)294 (34.9)TNM staging< 0.001 I476 (57.3)62 (7.3) II183 (22.0)37 (4.4) III75 (9.0)265 (31.4) IV96 (11.6)481 (56.9)Treatment< 0.001 Non-surgery324 (39.0)608 (72.0) Surgery506 (61.0)237 (28.0)Histologic type< 0.001 Adenocarcinoma668 (80.5)547 (64.7) Squamous cell carcinoma139 (16.7)268 (31.7) Other NSCLC23 (2.8)30 (3.6)Pulmonary function test FVC (L)3.09 ± 1.802.70 ± 0.63< 0.001 FEV1 (L)2.38 ± 0.532.01 ± 0.51< 0.001 FEV1/FVC (%)78.63 ± 9.5473.22 ± 10.12< 0.001 DLCO (L)18.76 ± 3.6517.03 ± 3.14< 0.001COPD status< 0.001 Non-COPD703 (84.7)470 (55.6) COPD127 (15.3)375 (44.4)Whole emphysema severity< 0.001 Non-emphysema614 (74.0)178 (21.1) Mild155 (18.7)272 (32.2) More than mild61 (7.3)395 (46.7)Whole emphysema ratio (%)0.18 (0–0.57)4.62 (1.39–6.85)< 0.001Regional emphysema severity< 0.001 No376 (45.3)113 (13.4) Low319 (38.4)225 (26.6) High135 (16.3)507 (60.0)Regional emphysema ratio (%)0.10 (0–0.49)1.11 (0.33–3.32)< 0.001ILA severity< 0.001 Non-ILA359 (43.3)74 (8.8) Equivocal ILA402 (48.4)281 (33.2) Definite ILA69 (8.3)490 (58.0)ILA ratio (%)0.12 (0.01–0.60)5.21 (3.77–6.49)< 0.001Regional ILA severity< 0.001 No335 (40.4)94 (11.1) Low336 (40.5)204 (24.1) High159 (19.2)547 (64.7)Regional ILA ratio (%)0.15 (0.01–0.87)2.09 (1.00–5.72)< 0.001Unless otherwise indicated, numbers represent the count of patients, with percentages in parentheses. Whole emphysema ratio, regional emphysema ratio, ILA ratio and regional ILA ratio were reported as median and interquartile range. TNM staging was based on the 9th AJCC system for lung cancerp < 0.05 indicates a statistically significant differenceCEA carcinoembryonic antigen, CA125 carbohydrate antigen 125, TNM tumor-node-metastasis, FVC forced vital capacity, FEV1 forced expiratory volume in 1 s, DLCO diffusing capacity of lung for carbon monoxide, COPD chronic obstructive pulmonary disease, NSCLC non-small cell lung cancer
Emphysema and ILA assessment agreement
The Kappa agreement between the observer and AI for the presence of whole emphysema, whole emphysema severity, the presence of ILA and ILA severity was excellent, with κ values of 0.87, 0.87, 0.83 and 0.85, respectively. The correlation coefficients between the AI and visual assessment for the presence and severity of whole emphysema were 0.87 and 0.93, respectively. Those for the presence of ILA and ILA severity were 0.83 and 0.92, respectively.
Survival analysis
The Kaplan–Meier survival curves indicated that patients with emphysema, whether whole or regional, had worse survival than those without emphysema (Fig. 4A, B). The OS of patients with mild emphysema was better than that of patients with more than mild emphysema (p < 0.001), and patients with high regional emphysema had lower OS rates than those with low regional emphysema (p < 0.001). Similarly, patients without ILA had a favorable prognosis (Fig. 4C, D). Survival rates decreased as ILA severity and the extent of regional ILA increased (both p < 0.001).Fig. 4. Kaplan–Meier curves showing NSCLC patient 5-year overall survival rate. Whole emphysema severity (A), regional emphysema severity (B), ILA severity (C), and regional ILA severity (D). ILA, interstitial lung abnormality. p < 0.05 was considered to indicate significance
Multivariable analysis demonstrated that whole emphysema severity (mild: HR, 1.66 [95% CI: 1.26, 2.18]; p < 0.001; and more than mild: HR, 2.55 [95% CI: 1.88, 3.48]; p < 0.001), and ILA severity (equivocal ILA: HR 1.63 [95% CI: 1.15, 2.32]; p = 0.006; and definite ILA: HR 2.33 [95% CI: 1.61, 3.35]; p < 0.001) were the independent prognostic factors for NSCLC, while regional emphysema and regional ILA severity (both p > 0.05) were not (Table 2). A multiplicative interaction was observed between emphysema and ILA for the 5-year OS rate (HR 5.65 [95% CI: 4.84, 6.59]; p < 0.001). However, no additive interaction was observed (relative excess risk due to interaction, p = 0.07; and attributable proportion of interaction, p = 0.07) (Table 3).Table 2. Univariable and multivariable Cox regression analyses of clinicopathological characteristics, CT-defined emphysema and interstitial lung abnormality severity for overall survivalVariablesUnivariable analysis HR (95% CI)p-valueMultivariable analysis HR (95% CI)p-valueAge (years)1.03 (1.03, 1.04)< 0.0011.02 (1.01, 1.02)< 0.001Sex (Male) Female0.70 (0.61, 0.81)< 0.0011.16 (0.96, 1.40)0.12Smoking history (No) Yes1.44 (1.25, 1.64)< 0.0011.09 (0.92, 1.30)0.32Family history (No) Yes1.55 (1.19, 2.01)0.0011.267 (0.97, 1.65)0.08Hypertension (No) Yes1.32 (1.11, 1.58)0.0020.86 (0.71, 1.04)0.11Diabetes mellitus (No) Yes0.95 (0.75, 1.19)0.64Coronary artery calcification (No) Yes1.12 (0.90, 1.41)0.29CEA (ng/mL) (≤ 5) Increased (> 5)2.18 (1.90, 2.51)< 0.0010.86 (0.73, 1.01)0.06CA125 (U/mL) (≤ 15) Increased (> 15)2.22 (1.94, 2.55)< 0.0011.12 (0.96, 1.31)0.15Tumor size (cm) (≤ 3) > 32.07 (1.80, 2.38)< 0.0011.08 (0.93, 1.27)0.32TNM staging (stage I) II1.48 (0.99, 2.23)0.060.86 (0.56, 1.30)0.47 III12.32 (9.33, 16.27)< 0.0015.06 (3.70, 6.93)< 0.001 IV15.44 (11.83, 20.15)< 0.0016.49 (4.81, 8.76)< 0.001Treatment (Non-surgery) Surgery0.35 (0.30, 0.41)< 0.0010.64 (0.54, 0.75)< 0.001Histologic type (Adenocarcinoma) Squamous cell carcinoma1.82 (1.57, 2.11)< 0.0010.99 (0.83, 1.19)0.93 Other NSCLC1.33 (0.92, 1.92)0.130.86 (0.59, 1.27)0.46Tumor location (Peripheral) Central1.86 (1.61, 2.14)< 0.0011.10 (0.93, 1.29)0.28COPD status (Non-COPD) COPD2.75 (2.40, 3.16)< 0.0011.10 (0.95, 1.29)0.22Whole emphysema severity (Non-emphysema) Mild4.08 (3.37, 4.93)< 0.0011.66 (1.26, 2.18)< 0.001 More than mild7.30 (6.10, 8.73)< 0.0012.55 (1.88, 3.48)< 0.001Regional emphysema severity (No) Low2.04 (1.63, 2.56)< 0.0010.98 (0.74, 1.30)0.90 High5.60 (4.56, 6.88)< 0.0010.73 (0.53, 1.02)0.06ILA severity (Non-ILA) Equivocal ILA2.91 (2.25, 3.75)< 0.0011.63 (1.15, 2.32)0.006 Definite ILA10.47 (8.18, 13.40)< 0.0012.33 (1.61, 3.35)< 0.001Regional ILA severity (No) Low1.94 (1.52, 2.48)< 0.0010.94 (0.69, 1.30)0.72 High6.02 (4.83, 7.50)< 0.0011.31 (0.95, 1.80)0.10Data in parentheses are 95% CI. TNM staging was based on the 9th AJCC system for lung cancerp < 0.05 indicates a statistically significant differenceCEA carcinoembryonic antigen, CA125 carbohydrate antigen 125, TNM tumor-node-metastasis, COPD chronic obstructive pulmonary disease, ILA interstitial lung abnormality, NSCLC non-small cell lung cancer, HR hazard ratioTable 3Interaction between CT-defined emphysema and interstitial lung abnormality on 5-year overall survivalHRRERIAPModel5.6510.020.48p-value< 0.0010.070.07Multiplicative interaction is assessed using HR. RERI and AP indicate additive interactionp < 0.05 indicates a statistically significant differenceILA interstitial lung abnormality, RERI relative excess risk due to interaction, AP attributable proportion, HR hazard ratio
Sensitivity analysis
Supplementary Table S1 presents the results of sensitivity analyses excluding participants who died within the first year of follow-up, showing that regional ILA severity was not an independent predictor of OS (p > 0.05). Stratified analyses by age, sex, smoking history, and tumor size also showed no significant associations (all p > 0.05; Supplementary Table S1). Additionally, results remained materially unchanged after multiple imputation of covariates (p > 0.05; Supplementary Table S1). In summary, the sensitivity analyses were consistent with the primary findings, indicating that regional ILA severity was not an independent prognostic factor for NSCLC. Therefore, it was excluded from model construction and is unlikely to influence the predictive performance of the model.
Incremental prognostic value of emphysema and ILA
Model 7, incorporating TNM staging, whole emphysema and ILA severity, achieved superior predictive performance for the 5-year OS (AUC: 0.90; Table 4), significantly outperforming other models (DeLong test: all p < 0.001). It also showed the highest C-index (0.80) and the lowest AIC (719.02), indicating good model fit and robustness. Time-dependent ROC further confirmed the incremental prognostic value of AI-quantified emphysema and ILA severity (Fig. 5A). The model demonstrated excellent agreement between predicted and observed outcome (Fig. 5B) and showed favorable clinical utility (Fig. 5C). The performance of Models 1–6 is detailed in the Supplementary Materials.Fig. 5. Prognostic performance and clinical usefulness of models. A Time-dependent areas under the ROC curve of the models. B The calibration curve of Model 7 shows the agreement between predicted and observed 5-year survival rate. The Hosmer-Lemeshow goodness-of-fit test shows that the p-value for model 7 is greater than 0.05. C The decision curve analysis of Model 7. The y-axis measures the net benefit, while the x-axis represents the different probability thresholds of mortality risk. The gray line represents the assumption that all patients have died by the fifth year. The black line represents the assumption that no patients have died by the fifth year. The modeling factors are all independent prognostic factors. Model 1: TNM staging. Model 2: whole emphysema severity. Model 3: ILA severity. Model 4: TNM staging and whole emphysema severity. Model 5: TNM staging and ILA severity. Model 6: whole emphysema severity and ILA severity. Model 7: TNM staging, whole emphysema severity and ILA severity. ROC, the time-dependent receiver operating characteristic curve; ILA, interstitial lung abnormality; TNM, tumor-node-metastasisTable 4Additional predictive value of AI-quantified whole emphysema severity and interstitial lung abnormality severity in combination with TNM stagingAICConcordance index5-year AUCp-valueModel 1810.540.76 (0.75–0.77)0.85< 0.001Model 2848.640.71 (0.70–0.72)0.79< 0.001Model 3842.810.71 (0.71–0.72)0.79< 0.001Model 4800.360.78 (0.78–0.79)0.88< 0.001Model 5793.330.79 (0.79–0.80)0.89< 0.001Model 6825.290.76 (0.75–0.77)0.86< 0.001Model 7719.020.80 (0.80–0.81)0.90Data in parentheses are 95% CIs. The modeling factors are all independent prognostic factors. Model 1: TNM staging. Model 2: whole emphysema severity. Model 3: ILA severity. Model 4: TNM staging and whole emphysema severity. Model 5: TNM staging and ILA severity. Model 6: whole emphysema severity and ILA severity. Model 7: TNM staging, whole emphysema severity and ILA severity. TNM staging was based on the 9th AJCC system for lung cancerp-value for the comparison of the AUC of model 7 with models 1–6, using DeLong’s test. p < 0.05 indicates a statistically significant differenceTNM tumor-node-metastasis, AIC Akaike information criterion, AUC the area under the time-dependent receiver operating characteristic curve, ILA interstitial lung abnormality
Using Model 7, which demonstrated the best performance, we generated Shapley additive explanation plots to better visualize the contributions of the factors to the model (Fig. 6). ILA severity was found to be more important than whole emphysema severity in predicting NSCLC patient survival.Fig. 6. Shapley additive explanation plot visualization for selected predictors. The broader the horizontal bar, the more significant the observed effect. In the color bar on the right side, blue indicates low predictors, and red indicates high predictors. Each feature’s contribution is shown on the x-axis. A feature with a positive Shapley value increases the risk of death. Readers can infer the risk of death according to the specific value of each predictor. ILA, interstitial lung abnormality; TNM, tumor-node-metastasis
Impact of AI-quantified emphysema and ILA on surgical eligibility in stages I–IIIA
For stage IA patients, mild emphysema (OR, 0.41 [95% CI: 0.32, 0.57]; p < 0.001; Table 5), more than mild emphysema (OR, 0.40 [95% CI: 0.28, 0.68]; p < 0.001), equivocal ILA (OR 0.51 [95% CI: 0.40, 0.69]; p < 0.001) and definite ILA (OR 0.41 [95% CI: 0.31, 0.64]; p < 0.001) were independent predictors of surgical resection. Similar results were observed in patients with stages IB–IIIA NSCLC (mild emphysema: OR 0.53, [95% CI: 0.37, 0.87], p = 0.002; more than mild emphysema: OR 0.47, [95% CI: 0.31, 0.87], p = 0.005; equivocal ILA: OR 0.55, [95% CI: 0.39, 0.83], p < 0.001; and definite ILA: OR 0.50, [95% CI: 0.23, 0.72], p = 0.007).Table 5. Odds ratio (95% CI and p-value) of having received versus not received lung cancer surgical resection by clinicopathological characteristics, CT-defined emphysema severity, and interstitial lung abnormality severityAdjusted odds ratio (95% CI) of receiving vs. not receiving lung cancer surgical resectionVariablesStage IAStage IB–IIIAWhole emphysema severity (Non-emphysema) Mild0.41 (0.32, 0.57), p < 0.0010.53 (0.37, 0.87), p = 0.002 More than mild0.40 (0.28, 0.68), p < 0.0010.47 (0.31, 0.87), p = 0.005ILA severity (Non-ILA) Equivocal ILA0.51 (0.40, 0.69), p < 0.0010.55 (0.39, 0.83), p < 0.001 Definite ILA0.41 (0.31, 0.64), p < 0.0010.50 (0.23, 0.72), p = 0.007Data in parentheses are 95% CISeparate analyses were performed for stage IA and stage IB–IIIA. Adjusted for family history, coronary artery calcification, and CEA in stage IA and adjusted for CEA and CA125 in stage IB–IIIA. p < 0.05 indicates a statistically significant differenceCEA carcinoembryonic antigen, CA125 carbohydrate antigen 125, ILA interstitial lung abnormality
Stratified analysis of OS by COPD status, treatment type and TNM staging
Patients were divided into three subgroups: (1) non-COPD and COPD; (2) non-surgery and surgery; and (3) stages I–IIIA and stages IIIB–IV (Supplementary Tables S2, 3 and Supplementary Figs. S1–6). The Supplementary Materials provide detailed emphysema and ILA impact on prognosis in different subgroups.
Discussion
To date, NSCLC prognosis stratification research has primarily focused on TNM staging, while lung cancer comorbidities have been overlooked [4]. In our study, AI-quantified whole emphysema severity and ILA severity were independent prognostic factors, while regional emphysema severity, regional ILA severity and COPD were not. Furthermore, AI-quantified whole emphysema and ILA severity could provide additional prognostic value with respect to TNM staging alone.
The presence of emphysema and ILA by visual assessment was associated with worse OS in NSCLC [13, 35]. However, visual assessment has limitations, including subjectivity and time-consuming, making objective quantification an ideal assistive task for AI [36]. In our study, AI-based assessment of emphysema and ILA showed excellent agreement with experienced radiologists, supporting the feasibility and reliability of AI for objective imaging evaluation. Kaplan–Meier survival curve and multivariable analyses revealed that NSCLC with more than mild emphysema and definite ILA had a higher mortality risk. A higher emphysema grade is associated with higher levels of matrix metalloproteinases and leads to a worse prognosis [37, 38]. Moreover, ILA can represent early pulmonary fibrosis in certain conditions [39]. Patients with definite ILA typically exhibit more severe lung tissue damage and fibrosis than those with equivocal ILA, leading to poorer prognosis [40]. Our study also observed a multiplicative interaction between emphysema and ILA, which has a significant synergistic negative impact on patient survival. Emphysema impairs gas exchange, while ILA reduces diffusing capacity. Their synergistic effects can severely compromise lung function [41]. Furthermore, the opposing physiological effects of emphysema and ILA may obscure functional impairment, delaying diagnosis and treatment, leading to worse outcomes in NSCLC [41]. However, there is no additive interaction between emphysema and ILA. One potential explanation is that changes in DNA methylation and histone modifications, telomere dysfunction and shortening are shared pathogenic mechanisms underlying both emphysema and pulmonary fibrosis [41].
Interestingly, our study found that regional emphysema severity was not an independent prognostic factor in a cohort including patients with stage I–IV, which was inconsistent with previous studies focusing on stages I–II [42]. Furthermore, COPD was also not an independent prognostic factor in our study. Recent studies have shown that some individuals exhibit emphysema and ILA in CT images before being diagnosed with COPD [26, 43]. This indicated that lung function impairment is associated with NSCLC patient outcome regardless of whether it meets the clinical criteria for a COPD diagnosis.
In our study, C-index, AIC and calibration analyses consistently showed that Model 7 outperformed the other models, demonstrating the best goodness-of-fit and robustness. These findings also confirm the added value of AI-quantified emphysema and ILA severity. Therefore, these quantified metrics should be considered in prognosis predictions for NSCLC patients in clinical practice to optimize treatment and monitoring. Moreover, the Shapley additive explanation plots of Model 7 showed that ILA is more important than emphysema in predicting NSCLC survival. This might be because emphysema involves the destruction of alveolar walls, whereas ILA is considered a potentially progressive interstitial lung disease [11, 44]. Interstitial lung disease frequently results in end-stage respiratory failure, particularly pulmonary fibrosis, increase in lung stiffness and the development of scarring [44, 45]. These features indicate that ILA may be related to worse OS than emphysema.
Stages I–IIIA NSCLC patients with emphysema and ILA were less likely to undergo surgery than no emphysema and ILA. Surgical resection remains the standard treatment for stages I–IIIA NSCLC; however, those who also have comorbidities are frequently considered inoperable due to low cardiopulmonary reserve [5, 46]. In clinical practice, patients who are not suitable for surgery may need to consider alternative treatment options, such as stereotactic radiation, targeted therapy and immunotherapy [47]. Consequently, the impact of emphysema and ILA on treatment decisions in patients with stages I–IIIA NSCLC must be fully considered, as this will facilitate the development of individualized treatment plans and improve patients’ prognosis.
In the COPD subgroup, although ILA severity stratified patients by risk in the Kaplan–Meier analysis, it was not an independent prognostic factor. This result is potentially due to shared genome segments, including FAM13A, DSP, and 17q21, between COPD and ILA, where these variants increase the risk of COPD but protect against ILA [44]. Whole emphysema severity was correlated with the mortality rate but was not an independent prognostic factor in patients who underwent surgery. The potential explanation is that patients who underwent surgery had a lower incidence of emphysema; furthermore, postoperative survival is affected by other factors, including age and reduced pulmonary function [48]. Additionally, whole emphysema and ILA severity were independent prognostic factors for NSCLC stage I–IIIA and IIIB–IV patients. Therefore, incorporating AI-quantified emphysema and ILA into the model can optimize the prognostic model’s prediction performance and improve survival risk stratification in NSCLC.
Our study has some limitations. This retrospective single-center study might be subject to potential selection and inherent biases. Therefore, future multicenter prospective clinical studies are needed to confirm our findings. Furthermore, while we investigated the OS outcome of NSCLC patients, the specific cause of mortality was not identified. Our next study will investigate specific causes of mortality in NSCLC patients. Finally, we used X-tile to determine the cutoff value for regional ILA severity since no reference cutoff is available; further validation in diverse populations is necessary.
In conclusion, AI-quantified emphysema and ILA severity not only independently predict increased mortality risk in NSCLC patients but also provide complementary prognostic value to TNM staging. These parameters, derived from baseline CT scans, serve as potential early markers of survival that can help stratify mortality risk in NSCLC patients. Future work will further evaluate the generalizability of these findings.
Supplementary information
ELECTRONIC SUPPLEMENTARY MATERIAL
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Zhang Y, Yi J, Sun D et al (2023) The association of chest computed tomography-defined visual emphysema and prognosis in patients with nonsmall cell lung cancer. ERJ Open Res 9:00195-202310.1183/23120541.00195-2023 PMC 1065864238020560 · doi ↗ · pubmed ↗
- 2Haesevoets S, Arents E, Cops D et al (2025) The impact of lung surgery, with or without (neo-)adjuvant therapy, on physical functioning in patients with nonsmall cell lung cancer: a scoping review. Eur Respir Rev 34:24015610.1183/16000617.0156-2024 PMC 1183667039971396 · doi ↗ · pubmed ↗
- 3Beghe B, Cerri S, Fabbri LM, Marchioni A (2021) COPD, pulmonary fibrosis and IL As in aging smokers: the paradox of striking different responses to the major risk factors. Int J Mol Sci 22:929210.3390/ijms 22179292 PMC 843091434502194 · doi ↗ · pubmed ↗
