Prediction of differentiation levels in lung adenocarcinoma using peripheral blood inflammatory cytokines and tumor markers
Yang Li, Jiahuan Wu, Meiling Long, Tingting Zeng, Depeng Jiang, Zhiling Yu, Zhiling Yu, Zhiling Yu, Zhiling Yu

TL;DR
This study shows that combining blood markers can help predict the aggressiveness of lung adenocarcinoma, improving treatment planning.
Contribution
A novel model combining inflammatory and tumor markers to predict lung adenocarcinoma differentiation levels.
Findings
AISI, CEA, ferritin, and ProGRP were identified as risk factors for low differentiation LUAD.
The model achieved an AUC of 0.795 for distinguishing low from high differentiation LUAD.
Decision curve analysis confirmed the model's clinical utility for personalized treatment planning.
Abstract
Lung Adenocarcinoma (LUAD) has highly aggressive and lethal, and its degree of differentiation significantly influences prognosis and treatment strategies, yet accurate prediction remains challenging. To assess the predictive value of combining peripheral blood inflammatory markers, such as the aggregate index of systemic inflammation (AISI), with tumor markers, including Carcinoembryonic Antigen (CEA) and Cytokeratin 19 fragment antigen 21−1(CYFRA21−1), etc, for determining LUAD differentiation levels. This retrospective study included 203 LUAD patients treated at Chongqing Medical University’s Second Affiliated Hospital, categorized by low and high differentiation. Demographic, clinical, and laboratory data including peripheral blood inflammatory and tumor markers were analyzed. A multivariate logistic regression model evaluated these markers’ predictive accuracy. AISI (OR = 1.64,…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Fig 1
Fig 2
Fig 3
Fig 4
Fig 5
Fig 6Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLung Cancer Research Studies · Ferroptosis and cancer prognosis · Inflammatory Biomarkers in Disease Prognosis
Introduction
Lung cancer is a leading cause of cancer incidence and mortality globally, posing a serious threat to human health. According to global cancer statistics (GLOBOCAN), approximately 2.2 million new lung cancer cases and 1.8 million related deaths occurred in 2020 [1]. Lung cancer is broadly divided into small cell lung cancer and non-small cell lung cancer (NSCLC), with NSCLC making up over 85% of cases. Lung Adenocarcinoma (LUAD), the most prevalent NSCLC subtype, accounts for around 40% of cases, followed by squamous cell carcinoma at 25%. Despite advances in diagnosis and treatment, LUAD remains highly aggressive and lethal, with a five-year survival rate below 5% [2]. Factors such as tumor size, differentiation level, Tumor Node Metastasis classification(TNM) stage, and lymph node metastasis significantly influence LUAD treatment outcomes and survival.
The connection between inflammation and cancer has become a primary focus in recent research. Inflammatory responses are crucial for tissue repair after tumor-induced damage and can influence tumor microenvironment stability, exhibiting both tumor-suppressive and tumor-promoting effects [3]. Studies indicate that systemic inflammatory markers, such as neutrophil-to-lymphocyte ratio (NLR) and platelet-to-lymphocyte ratio (PLR), are closely linked with tumor invasion, recurrence, metastasis, and prognosis [4–5]. These ratios are key indicators of systemic inflammation and strongly correlate with cancer outcomes [6–7]. Given LUAD’s high mortality and complex pathology, identifying precise diagnostic and prognostic tools is essential [8]. Tumor differentiation plays a central role in assessing treatment response and survival, requiring accurate evaluation to guide personalized therapy. However, traditional histopathology faces limitations like subjectivity and delayed results in evaluating tumor differentiation [9].
This study evaluated the clinical utility of combining peripheral blood inflammatory markers aggregate index of systemic inflammation (AISI) with specific tumor markers to predict differentiation levels in LUAD. We hypothesized that particular marker combinations may more accurately reflect the biological characteristics of LUAD, offering clinicians a novel, non-invasive tool for assessing tumor differentiation. This method could improve treatment decisions by enhancing specificity and therapeutic effectiveness.
Methods and materials
Study population
This retrospective study included 203 lung adenocarcinoma patients consecutively admitted to the Second Affiliated Hospital of Chongqing Medical University from June 1, 2022, to July 30, 2024. Data were accessed between December 1, 2024 and January 1, 2025. This study utilized anonymized medical records from the Second Affiliated Hospital of Chongqing Medical University. Inclusion criteria were as follows: age 18–90 years, LUAD was initially confirmed by pathological diagnosis and complete clinical and follow-up data. Exclusion criteria included: age < 18 or >90 years, lack of pathological evidence for malignant lung tumors, non-adenocarcinoma lung cancers, prior anti-tumor treatment, undefined pathological stage or differentiation level, and incomplete clinical or follow-up data. The study focused on patients with low and high differentiation levels, so cases with other differentiation levels were excluded. This resulted in two groups: low differentiation and high differentiation. The patient selection process is shown in Fig 1 [10]. All personal identifiers were removed prior to researcher access, and this retrospective study was conducted in accordance with the ethical principles of the Declaration of Helsinki and was approved by the Second Affiliated Hospital of Chongqing Medical University Ethics Committee (No.85, 2024), with a waiver of informed consent obtained.
The procedure of the study.
Definition of low and high differentiation
Low differentiated LUAD refers to cancer cells with minimal differentiation, showing marked deviations from the epithelial cells of normal lung glands. These cells exhibit irregular shapes, vary in size, have enlarged and darkly stained nuclei, and feature an increased nucleus-to-cytoplasm ratio, indicating a loss of normal cellular characteristics (Figure SIa). High differentiated LUAD consists of cancer cells that are highly differentiated, closely resembling normal lung gland epithelial cells in morphology. These cells appear regular, with smaller nuclei and a normal nucleus-to-cytoplasm ratio (Figure SIb) [11].
AISI calculation and pathological differentiation definition
The AISI score was calculated using the formula: neutrophils×platelets× monocytes/lymphocytes [6].
Statistics analysis
For the examination of categorical variables, the chi-square test or Fisher’s exact test was chosen as the preferred method. To assess continuous variables, both the Mann-Whitney U test and the independent samples T test were employed. Due to the binary nature of our dependent variable, logistic regression models were employed. Multivariate binary logistic regression models were constructed based on univariate logistic regression results. The performance of the model was evaluated through discrimination, as measured by the Area Under the Receiver Operating Characteristic Curve (ROC). Clinical utility was evaluated using decision curve analysis (DCA), and internal validation was conducted via bootstrapping with 1,000 repetitions. Statistical analyses were performed using R version 4.4.3, with P-value < 0.05 considered statistically significant.
Results
Baseline characteristics of study population
The study retrospectively analyzed 203 patients with lung adenocarcinoma, including 109 men (53.69%) and 94 women (46.31%). Most patients did not report smoking or drinking habits, and a minority had chronic conditions such as diabetes (18 patients, 8.87%), hypertension (54 patients, 26.6%), and coronary heart disease (10 patients, 4.93%). All cases were pathologically confirmed as lung adenocarcinoma and classified into stages 0–4 according to American Joint Committee on Cancer Tumor-Node-Metastasis Staging System(AJCC TNM Staging). Based on pathologist evaluations, patients were categorized into low-differentiation (159 patients, 78.33%) and high-differentiation (44 patients, 21.67%) groups. Table 1 presents the detailed baseline characteristics of the cohort.
Table 1: Baseline characteristics.
Table 2 presents a descriptive analysis of the demographic characteristics, routine peripheral blood test results, and standard lung cancer tumor markers for the study population.The average age was 62.10 ± 11.54 years, and the mean body mass index (BMI) was 22.96 ± 3.40. The mean AISI was 5.81 ± 1.03, with a median of 5.77. Median values for tumor markers were as follows: Carcinoembryonic Antigen (CEA) at 7.16, Cytokeratin 19 fragment antigen 21−1(CYFRA21−1) at 2.97, ferritin at 143.06, Neuron Specific Enolase (NSE) at 13.86, Squamous Cell Carcinoma Antigen (SCCA) at 0.70, Tissue Polypeptide Antigen (TPA) at 103.6, and Progastrin Releasing Peptide (ProGRP) at 31.68.
Table 2: Analysis of the demographic characteristics.
Analysis of LUAD patients with different differentiation levels
In this study, LUAD differentiation was classified by pathologists based on the resemblance between tumor cells and normal lung tissue cells, dividing patients into low differentiation (159 patients, 78.33%) and high differentiation groups (44 patients, 21.67%). Findings are summarized in Table 3. Baseline characteristics showed no statistically significant differences between the groups (P > 0.05). However, significant differences were observed in laboratory test results, including neutrophils (P = 0.001), monocytes (P = 0.002), platelets (P = 0.008), ln-AISI (P = 0.002), CEA (P < 0.001), CYFRA21–1 (P < 0.001), ferritin (P < 0.001), NSE (P = 0.016), TPA (P < 0.001), and ProGRP (P = 0.004). Due to the limited number of low-differentiated patients in stages 0–2 (only one case), statistical analysis was not feasible for this subgroup. Therefore, the primary analysis focused on stages 3–4, with detailed results presented in the supplementary materials and baseline characteristics shown in S1 Table in S1 File.
Table 3: Baseline characteristics and laboratory examinations of two groups.
Due to skewed laboratory test data, the Mann-Whitney U test was applied (Table 4), revealing statistically significant differences in CEA, CYFRA21–1, and TPA values (p < 0.05).The skewed laboratory test data for the stage 3–4 group is presented in Table S2 in S1 Table.
Table 4: Nonnormal data of two groups.
The multivariate logistic regression model assessed the probability of low differentiation in lung adenocarcinoma patients (Table 5). Key influencing factors were identified through a backward stepwise regression analysis, and the results are presented in Fig 2. Key predictors included AISI (OR = 1.64, 95% CI = 1.08–2.58, p = 0.024), CEA (OR = 1.02, 95% CI = 1.00–1.04, p = 0.0497), Ferritin (OR = 1.01, 95% CI = 1.00–1.01, p = 0.010), and ProGRP (OR = 1.03, 95% CI = 1.00–1.07, p = 0.047). To develop a nomogram for predicting the probability of low differentiation in lung cancer based on the optimized model (Fig 3), with no multicollinearity among the variables (Supplementary Table 3). The result showed that higher AISI, higher CEA, higher ferritin and higher ProGRP were risk factors. This nomogram offered a visual point – based system that is established on AISI, CEA, ferritin, and ProGRP to estimate the likelihood of poorly differentiated lung cancer. To calculate the probability of poorly differentiated lung cancer, the points assigned to each of the four variables, as determined on the scale, are summed up to obtain a total score. A vertical line is then drawn from the total point scale to the final axis, and the corresponding probability of poorly differentiated lung cancer can be obtained. The model achieved an AUC of 0.795 (95% CI: 0.726–0.864), as shown in Fig 4. The calibration curve was plotted using the Bootstrap method with 1,000 repetitions. The dashed diagonal line represents the ideal line, which signifies a perfect prediction. The solid line indicates the actual prediction performance of the model. The closer these two lines are to each other, the better the model’s predictive accuracy. The results are presented in Fig 5. The absolute error between the simulated curve and the actual curve is 0.012, demonstrating that the two curves follow a highly similar trend and exhibit strong consistency. The decision curve for the established nomogram was constructed. The clinical value of this model increases as its curve moves away from the “treat all” or “treat none” curves. The threshold at which patients benefit from this model ranges from 30% to 95%. Furthermore, the decision curve analysis depicted in Fig 6 confirms the clinical utility of the model, demonstrating a favorable net benefit rate. This analysis further validates the model’s effectiveness in clinical practice.
Table 5: The multiple logistic regression model.
The optimized multiple logistic regression model (backward stepwise regression).(The odds ratio (OR) > 1.00, indicating that the variable is associated with an increased risk of poorly differentiated LUAD. AISI:Aggregate Index of Systemic Inflammation;CEA: Carcinoembryonic Antigen;ProGRP: Progastrin-Releasing Peptide).
The nomogram model to predict low differentiation in lung cancer.(Using the nomogram, clinicians can derive a score based on individual patient characteristics. This total score is then translated into a probability of risk of poorly differentiated LUAD, facilitating stratified management and personalized interventions for higher risk patients. AISI: Aggregate Index of Systemic Inflammation; CEA: Carcinoembryonic Antigen; ProGRP: Progastrin-Releasing Peptide).
The AUC of the nomogram model.( (AUC is 0.795 (95% CI: 0.726 - 0.864), indicateing that the model possesses a relatively good discriminatory ability and accuracy).
The calibration curve of the nomogram model.(The calibration curve was generated using 1,000 bootstrap repetitions. The diagonal dashed line represents the ideal case of perfect prediction, while the solid line indicates the actual performance of our model. Closer agreement between the two lines signifies better predictive accuracy).
The decision curve of the nomogram model.(The nomogram offered a greater net benefit compared to both the “treat-all” and “treat-none” approaches over a threshold probability range from 30% to 95%, confirming its clinical utility in routine practice).
Discussion
This study demonstrated that peripheral blood biomarkers and tumor biomarkers were closely associated with pathological differentiation in LUAD. By integrating these markers, we developed a predictive model with an AUC of 0.795 (95%CI: 0.726–0.864), outperforming traditional indices such as SIRI (AUC:0.625) and LDH (AUC:0.596). The findings align with recent advances in biomarker-driven oncology, which emphasize the synergistic role of tumor biology and systemic inflammation in cancer progression [12–15].
The differentiation status of LUAD is a critical prognostic factor. Well differentiated tumors typically exhibit slow growth and localized spread, leading to higher patient survival rates and more favorable treatment outcomes. In contrast, poorly differentiated tumors are characterized by high aggressiveness, early metastasis, and heightened resistance to therapy [16–19]. Our model’s ability to noninvasively predict differentiation addresses a clinical gap by reducing reliance on invasive biopsies, thereby improve detection rates and reduce healthcare costs. This aligns with NCCN guidelines advocating for non-invasive biomarker integration in NSCLC management [20–24]. However, as a pathological feature, the biological mechanism of differentiation degree may involve the dynamic interaction between tumor microenvironment (TME) and angiogenesis. The heterogeneity of TME may affect the differentiation status of LUAD [25]. Well differentiated LUAD usually presents with intact glandular structure and low nuclear atypia, while poorly differentiated LUAD is often accompanied by an increase in micropapillary/solid components, suggesting the difference between immune cell infiltration and matrix remodeling in the TME. The potential mechanisms are as follows: (1) Inflammatory cell infiltration: Poorly differentiated LUAD may promote the formation of the immunosuppressive microenvironment by recruiting M2-type macrophages or regulatory T cells (Tregs), and inhibit tumor differentiation-related signaling pathways (such as Wnt/β-catenin) [26–27]. (2) Extracellular matrix (ECM) remodeling: In poorly differentiated tumors, fibroblast activation is enhanced, secreting large amounts of collagen and hyaluronic acid, which may interfere with the epithelial cell differentiation process through mechanical pressure or signaling molecules (such as TGF-β) [28]. (3) Metabolic reprogramming: Poorly differentiated LUAD may rely on the glycolytic pathway (Warburg effect), leading to lactic acid accumulation and inhibiting the reverse process of epithelial-mesenchymal transition (EMT), thereby maintaining an undifferentiated state [29]. Previous studies had shown that inflammatory markers such as AISI were associated with the prognosis of LUAD, but few studies on the differentiation mechanism.
Inflammatory markers, including AISI and neutrophils, reflect a tumor microenvironment that promotes angiogenesis and immune evasion. This is complemented by tumor biomarkers such as CEA and CYFRA 21–1, which directly quantify malignant cellular activity. This integrative approach provides a holistic view of LUAD behavior, consistent with studies highlighting the interplay between inflammation and tumor differentiation [21,30–32]. Notably, the elevated CYFRA21–1 levels observed in poorly differentiated LUAD may stem from increased cytokeratin-19 fragment release during rapid tumor proliferation, a phenomenon previously reported in undifferentiated malignancies [33–35].
Secondary markers such as ferritin and ProGRP offer additional insights into LUAD heterogeneity and improve model performance. Elevated ferritin suggests dysregulated iron metabolism, which can promote tumor progression through oxidative stress and hypoxia adaptation [36–38]. Similarly, ProGRP detection in LUAD indicates an overlap with neuroendocrine mechanisms, warranting further investigation into its utility for subtype classification [39–41].
However, limitations must be acknowledged. First, although multicollinearity tests were employed during model construction to reduce the risk of overfitting, this risk remains in studies with limited sample sizes. Second, the retrospective data used in this study were solely sourced from the Second Affiliated Hospital of Chongqing Medical University, which may introduce potential geographic and population biases. We acknowledged that the model developed from this single center dataset may have limited generalizability, and its performance could decline when applied to populations from other regions, with different ethnic backgrounds, or under varying healthcare settings. Third, we recognized that a truly robust model requires validation with data from other independent centers, but the step had not yet been accomplished in this study. To directly address the limitations identified in this study, we propose several key directions for future research. First, multi center collaborations should be initiated to conduct rigorous external validation of our current model using independent datasets from various institutions. This is an essential step for confirming the model’s clinical utility and generalizability. Second, large scale prospective cohort studies are warranted to elucidate the temporal relationship between AISI and the incidence and progression of lung adenocarcinoma. Third, integrating more precise diagnostic tools for lung adenocarcinoma differentiation could enhance the accuracy and reliability of research findings. Furthermore, developing predictive models that incorporate AISI with other biomarkers would help improve prognostic accuracy. Finally, fundamental experimental research focusing on areas such as tumor microenvironment analysis and single-cell studies should be pursued to progressively address the current limitations of the research.
Conclusion
This study suggested that inflammatory factors may serve as potential predictors for the differentiation of LUAD. By integrating inflammatory and tumor biomarkers, our predictive model effectively assessed pathological differentiation, aided in personalized treatment planning, and offered a reliable foundation for clinical decisions. Future studies should focued on elucidating the underlying biological mechanisms and validating the model’s clinical efficacy.
Supporting information
S1 FigSupplementary figures.(DOCX)
S1 TableSupplementary tables.(DOCX)
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA Cancer J Clin. 2021;71(3):209–49. doi: 10.3322/caac.21660 33538338 · doi ↗ · pubmed ↗
- 2Denisenko TV, Budkevich IN, Zhivotovsky B. Cell death-based treatment of lung adenocarcinoma. Cell Death Dis. 2018;9(2):117. doi: 10.1038/s 41419-017-0063-y 29371589 PMC 5833343 · doi ↗ · pubmed ↗
- 3Malik S, Waquar S, Idrees N, Malik A. Impending role of inflammatory markers and their specificity and sensitivity in breast cancer patients. Sci Rep. 2024;14(1):15117. doi: 10.1038/s 41598-024-65821-8 38956273 PMC 11219843 · doi ↗ · pubmed ↗
- 4Nierengarten MB. Cancer Statistics 2024: Deaths drop, incidences increase, prevention needed. Cancer. 2024;130(11):1904. doi: 10.1002/cncr.35347 38757629 · doi ↗ · pubmed ↗
- 5Bizuayehu HM, Ahmed KY, Kibret GD, Dadi AF, Belachew SA, Bagade T, et al. Global disparities of cancer and its projected burden in 2050. JAMA Netw Open. 2024;7(11):e 2443198. doi: 10.1001/jamanetworkopen.2024.43198 39499513 PMC 11539015 · doi ↗ · pubmed ↗
- 6Wang H-K, Wei Q, Yang Y-L, Lu T-Y, Yan Y, Wang F. Clinical usefulness of the lymphocyte-to-monocyte ratio and aggregate index of systemic inflammation in patients with esophageal cancer: a retrospective cohort study. Cancer Cell Int. 2023;23(1):13. doi: 10.1186/s 12935-023-02856-3 36707809 PMC 9881346 · doi ↗ · pubmed ↗
- 7Wang L, Li X, Liu M, Zhou H, Shao J. Association between monocyte-to-lymphocyte ratio and prostate cancer in the U.S. population: a population-based study. Front Cell Dev Biol. 2024;12:1372731. doi: 10.3389/fcell.2024.1372731 38645410 PMC 11026607 · doi ↗ · pubmed ↗
- 8Zheng Y, Tian H, Zhou Z, Xiao C, Liu H, Liu Y, et al. A novel immune-related prognostic model for response to immunotherapy and survival in patients with lung Adenocarcinoma. Front Cell Dev Biol. 2021;9:651406. doi: 10.3389/fcell.2021.651406 33816503 PMC 8017122 · doi ↗ · pubmed ↗
