# Based on single-cell and transcriptome analysis of inflammatory pathway biomarkers and their molecular mechanisms in chronic obstructive pulmonary disease

**Authors:** Yaping Zhou, Hui Gong, Zelin Hao, Lu Wang, Li Li, Xiaoguang Zou

PMC · DOI: 10.1371/journal.pone.0343798 · PLOS One · 2026-02-25

## TL;DR

This study identifies key genes and cell types involved in COPD inflammation using single-cell and transcriptome data, offering new insights for precision therapies.

## Contribution

The novel integration of transcriptomic and single-cell analyses reveals key genes and macrophage dynamics in COPD inflammation.

## Key findings

- CXCL12, CXCR4, GGT1, and VWF are key genes in COPD pathogenesis.
- Macrophages are the only cell type with significant abundance differences in COPD.
- CXCR4 shows persistent expression throughout macrophage differentiation.

## Abstract

Systemic inflammation in chronic obstructive pulmonary disease (COPD) presents significant therapeutic challenges. Our study employs integrated transcriptomic and single-cell analyses to identify inflammation-related biomarkers and elucidate their pathogenic mechanisms in COPD.

Training dataset GSE37768, validation dataset GSE239897, and single-cell dataset GSE249584 were retrieved from the GEO database. Inflammation-associated genes were screened from the GeneCards database. Differential expression analysis was employed to identify candidate genes, followed by machine learning approaches and expression validation to pinpoint key genes. Functional characterization of these key genes was conducted through Gene Set Enrichment Analysis (GSEA), immune infiltration profiling, molecular regulatory network construction, drug prediction, and GeneMANIA interaction analysis. Single-cell data analysis elucidated cellular heterogeneity and identified critical cell types. Pseudotime analysis was subsequently performed to investigate the roles of key genes throughout developmental trajectories within these critical cell types.

Twelve candidate genes associated with COPD and inflammation were screened, followed by GO and KEGG enrichment analyses. Subsequently, Least Absolute Shrinkage and Selection Operator (LASSO) regression and Support Vector Machine-Recursive Feature Elimination (SVM-RFE) modeling identified six candidate key genes. Among these, only CXCL12, CXCR4, GGT1, and VWF exhibited consistent expression patterns across both training and validation datasets, establishing them as key genes. Their diagnostic value was further validated by constructing an artificial neural network model. Immune infiltration analysis revealed aberrant basophil abundance in COPD. Single-cell analysis annotated 11 distinct cell types, with macrophages representing the sole cell type demonstrating significant abundance differences between COPD and control groups. Pseudotime trajectory analysis delineated nine differentiation states, wherein CXCR4 expression persisted throughout the cellular differentiation trajectory.

This study identified CXCL12, CXCR4, GGT1, and VWF as key genes in COPD pathogenesis. Macrophages constituted the only cell type exhibiting significant abundance alterations, with CXCR4 demonstrating persistent expression throughout macrophage differentiation trajectories. These findings provide valuable insights and suggest potential directions for developing precision therapeutic strategies for COPD.

## Linked entities

- **Genes:** CXCL12 (C-X-C motif chemokine ligand 12) [NCBI Gene 6387], CXCR4 (C-X-C motif chemokine receptor 4) [NCBI Gene 7852], GGT1 (gamma-glutamyltransferase 1) [NCBI Gene 2678], VWF (von Willebrand factor) [NCBI Gene 7450]
- **Diseases:** chronic obstructive pulmonary disease (MONDO:0005002)

## Full-text entities

- **Genes:** CXCR6 (C-X-C motif chemokine receptor 6) [NCBI Gene 10663] {aka BONZO, CD186, CDw186, STRL33, TYMSTR}, CYBA (cytochrome b-245 alpha chain) [NCBI Gene 1535] {aka CGD4, p22-PHOX}, PTPN22 (protein tyrosine phosphatase non-receptor type 22) [NCBI Gene 26191] {aka LYP, LYP1, LYP2, PEP, PTPN22.5, PTPN22.6}, CXCL12 (C-X-C motif chemokine ligand 12) [NCBI Gene 6387] {aka IRH, PBSF, SCYB12, SDF1, TLSF, TPAR1}, GGT1 (gamma-glutamyltransferase 1) [NCBI Gene 2678] {aka CD224, D22S672, D22S732, GGT, GGT 1, GGTD}, CXCR4 (C-X-C motif chemokine receptor 4) [NCBI Gene 7852] {aka CD184, D2S201E, FB22, HM89, HSY3RR, LCR1}, CMPK1 (cytidine/uridine monophosphate kinase 1) [NCBI Gene 51727] {aka CK, CMK, CMPK, UMK, UMP-CMPK, UMPK}, NOD2 (nucleotide binding oligomerization domain containing 2) [NCBI Gene 64127] {aka ACUG, BLAU, BLAUS, CARD15, CD, CLR16.3}, NLRP1 (NLR family pyrin domain containing 1) [NCBI Gene 22861] {aka AIADK, CARD7, CIDED, CLR17.1, DEFCAP, DEFCAP-L/S}, TP63 (tumor protein p63) [NCBI Gene 8626] {aka AIS, B(p51A), B(p51B), EEC3, KET, LMS}, NLRP3 (NLR family pyrin domain containing 3) [NCBI Gene 114548] {aka AGTAVPRL, AII, AVP, C1orf7, CIAS1, CLR1.1}, IL17RA (interleukin 17 receptor A) [NCBI Gene 23765] {aka CANDF5, CD217, CDw217, IL-17RA, IL17R, IMD51}, HLA-DQB1 (major histocompatibility complex, class II, DQ beta 1) [NCBI Gene 3119] {aka CELIAC1, HLA-DQB, IDDM1}, CD4 [NCBI Gene 100136285], BDNF (brain derived neurotrophic factor) [NCBI Gene 627] {aka ANON2, BULN2}, VWF (von Willebrand factor) [NCBI Gene 7450] {aka F8VWF, VWD}, KRT5 (keratin 5) [NCBI Gene 3852] {aka CK5, DDD, DDD1, EBS1, EBS2, EBS2A}
- **Diseases:** REGULATION (MESH:C564833), ENDOTHELIAL (MESH:D005642), COLLAGEN FIBRIL (MESH:D014693), immune dysregulation (OMIM:614878), Tuberculosis (MESH:D014376), CONTAINING (MESH:C565447), ENDOPLASMIC RETICULUM (MESH:D008228), vascular injury (MESH:D057772), COVID-19 (MESH:D000086382), cardiovascular complications (MESH:D002318), blood coagulation (MESH:D001778), COTRANSLATIONAL PROTEIN TARGETING (MESH:D011488), atherosclerosis (MESH:D050197), Coronavirus disease (MESH:D018352), MITOCHONDRIAL PROTEIN (MESH:C565376), ACTIN (MESH:C579880), SMELL (MESH:D000857), MITOCHONDRIAL TRANSLATION (OMIM:614922), airway obstruction (MESH:D000402), COPD inflammation (MESH:D011014), COPD (MESH:D029424), CYTOSOLIC RIBOSOME (MESH:C536654), PROTEIN LOCALIZATION (MESH:D004828), lung function decline (MESH:D055370), Asthma (MESH:D001249), SUBUNIT (MESH:C580233), endothelial dysfunction (MESH:D014652), MITOCHONDRIAL (MESH:D028361), respiratory disease (MESH:D012140), Inflammation (MESH:D007249)
- **Chemicals:** fatty acid (MESH:D005227), muramyl dipeptide (MESH:D000119)
- **Species:** Homo sapiens (human, species) [taxon 9606], Nicotiana tabacum (American tobacco, species) [taxon 4097]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12935203/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12935203/full.md

## References

49 references — full list in the complete paper: https://tomesphere.com/paper/PMC12935203/full.md

---
Source: https://tomesphere.com/paper/PMC12935203