# Multi-omics data-based modeling reveals tumorigenesis- and prognosis-associated genes with clinical potential in lung adenocarcinoma

**Authors:** Zhendong Lu, Pengfei Bao, Taiwei Wang, Kairui Hu, Lina Zhang, Ling Yi, Yuanming Pan, Weiying Li, Zhi John Lu, Jinghui Wang, Junzhong Ruan

PMC · DOI: 10.1186/s12885-025-14943-x · BMC Cancer · 2025-11-10

## TL;DR

This study uses multi-omics data to identify genes linked to lung adenocarcinoma development and prognosis, offering potential biomarkers and drug targets.

## Contribution

The novel integration of ATAC-seq and RNA-seq data identifies 14 genes with clinical potential for lung adenocarcinoma.

## Key findings

- 337 consensus genes were identified by integrating chromatin and mRNA data.
- Nine predictive and five prognostic genes were validated using external datasets and cfRNA analysis.
- The genes show potential for early detection and as therapeutic targets in lung adenocarcinoma.

## Abstract

This study aims to utilize multi-omics high-throughput sequencing data, including ATAC-seq and RNA-seq data from TCGA, GTEx, and GEO databases, to construct predictive and prognostic models for lung adenocarcinoma (LUAD) and identify potential biomarkers. We first obtained LUAD ATAC-seq data from TCGA and identified differential chromatin regions and genes through functional analysis. Differential peaks (DPs) potentially influencing LUAD progression were determined by analyzing patients at different stages, and these DPs were annotated to the genome to obtain differential peak genes (DPGs). We then integrated RNA-seq data from GTEx and TCGA to identify differentially expressed genes (DEGs) at the mRNA level, and by intersecting DEGs with DPGs, we identified 337 consensus genes (CGs). Using random forest and LASSO algorithms, we screened the CGs and constructed a predictive model comprising nine predictive-related genes (Pre-RGs), which was validated with an external dataset (GSE140343). Additionally, through Kaplan-Meier and Cox analyses combined with LASSO, five prognostic-related genes (Pro-RGs) were identified and used to establish a prognostic Cox proportional hazards model, also validated by GSE140343. Single-cell dataset analysis examined the expression of Pre-RGs and Pro-RGs across immune cell types, and further meta-analysis in the LCE database verified their expression differences and prognostic significance. Furthermore, we sequenced cell-free RNAs (cfRNAs) from 50 plasma samples (25 early-stage lung cancer and 25 benign pulmonary disease cases) to validate early cancer detection. Overall, we identified signatures including S100A8, GPM6A, FEZ1, OTX1, DNAH14, XDH, XPR1, SLC39A11, OCIAD2, TNS4, RHOV, YWHAZ, CLEC12A, and CASZ1, which show potential as drug targets and biomarkers for predicting LUAD development, prognosis, and early detection.

The online version contains supplementary material available at 10.1186/s12885-025-14943-x.

## Linked entities

- **Genes:** S100A8 (S100 calcium binding protein A8) [NCBI Gene 6279], GPM6A (glycoprotein M6A) [NCBI Gene 2823], FEZ1 (fasciculation and elongation protein zeta 1) [NCBI Gene 9638], OTX1 (orthodenticle homeobox 1) [NCBI Gene 5013], DNAH14 (dynein axonemal heavy chain 14) [NCBI Gene 127602], XDH (xanthine dehydrogenase) [NCBI Gene 7498], XPR1 (xenotropic and polytropic retrovirus receptor 1) [NCBI Gene 9213], SLC39A11 (solute carrier family 39 member 11) [NCBI Gene 201266], OCIAD2 (OCIA domain containing 2) [NCBI Gene 132299], TNS4 (tensin 4) [NCBI Gene 84951], RHOV (ras homolog family member V) [NCBI Gene 171177], YWHAZ (tyrosine 3-monooxygenase/tryptophan 5-monooxygenase activation protein zeta) [NCBI Gene 7534], CLEC12A (C-type lectin domain family 12 member A) [NCBI Gene 160364], CASZ1 (castor zinc finger 1) [NCBI Gene 54897]
- **Diseases:** lung adenocarcinoma (MONDO:0005061)

## Full-text entities

- **Diseases:** tumorigenesis (MESH:D063646), lung adenocarcinoma (MESH:D000077192)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12604227/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12604227/full.md

## References

13 references — full list in the complete paper: https://tomesphere.com/paper/PMC12604227/full.md

---
Source: https://tomesphere.com/paper/PMC12604227