# Integration of single cell and bulk transcriptomic analyses identifies FAM189A2 as a key prognostic gene in lung cancer

**Authors:** Peng Guo, Mengqi Xiang, Jing Chen, Huachuan Zhang

PMC · DOI: 10.3389/fimmu.2025.1701806 · Frontiers in Immunology · 2026-01-06

## TL;DR

This study identifies FAM189A2 as a key gene linked to better survival in lung cancer by combining single-cell and bulk RNA data, offering a new biomarker and risk model.

## Contribution

The integration of single-cell and bulk transcriptomic data reveals FAM189A2 as a novel prognostic gene and an eight-gene risk model for lung cancer.

## Key findings

- FAM189A2 downregulation in tumors correlates with worse survival, while its overexpression reduces cancer cell migration and invasion.
- An eight-gene LASSO-Cox model stratifies lung cancer patients into high- and low-risk groups across multiple cohorts.
- High-risk tumors show immune-enriched microenvironments, while low-risk tumors are more sensitive to Aurora kinase and mTOR inhibition.

## Abstract

Lung cancer remains the leading cause of cancer−related mortality. Single−cell RNA sequencing (scRNA−seq) enables high−resolution mapping of tumor microenvironment, but how malignant cell states connect to patient outcomes and therapeutic vulnerabilities is not fully understood.

We analyzed scRNA−seq profiles from 13 primary lung tumors and matched normal lung tissues (48,149 cells). Malignant versus non−malignant epithelial cells were defined by inferCNV, co−expression programs resolved with Hotspot, transcription factor regulons inferred by pySCENIC, and differentiation potential estimated by CytoTRACE and diffusion pseudotime. Key malignant programs were cross−checked in an independent LUAD scRNA−seq cohort (GSE131907). Program signatures were projected into TCGA−LUAD to build an eight−gene LASSO−Cox risk model, which was further validated in two external NSCLC/LUAD cohorts (GSE30219 and GSE31210). Immune contexture was inferred by CIBERSORT and TIDE, drug sensitivities predicted with pRRophetic, and the function of FAM189A2 tested by knockdown and overexpression in A549 and NCI−H23 cells.

We delineated 12 major cell populations and six malignant epithelial subclusters, including an early GPRC5A+ subpopulation with high developmental potential. Malignant programs corresponding to cell−cycle and inflammatory states (Modules 8, 14 and 16) were associated with worse overall survival, whereas Module 15, preferentially active in the GPRC5A+ state, was linked to better outcomes. An eight−gene signature (LAMA5, ACTB, B4GALT1, KLF5, KRT18, FAM189A2, SLC34A2, S100A11) robustly stratified patients into high− and low−risk groups across TCGA−LUAD and two validation cohorts. High−risk tumors displayed an immune−enriched yet matrix−restrained microenvironment with higher CD8+ T cells, whereas low−risk tumors showed greater predicted sensitivity to Aurora kinase, IGF−1R, mTOR and TGF−β inhibition. FAM189A2 was down−regulated in tumors, higher expression predicted better survival, and bidirectional perturbation in A549 and H23 cells demonstrated that loss of FAM189A2 enhanced, while overexpression attenuated, migration and invasion in vitro.

Integrating single−cell and bulk transcriptomes links malignant epithelial state programs to prognosis, yields a practical eight−gene risk model validated in multiple LUAD cohorts, and nominates FAM189A2 as a putative tumor suppressor and potential biomarker in lung cancer. These findings suggest testable strategies for risk stratification and therapy selection that warrant prospective evaluation.

## Linked entities

- **Genes:** ENTREP1 (endosomal transmembrane epsin interactor 1) [NCBI Gene 9413], LAMA5 (laminin subunit alpha 5) [NCBI Gene 3911], ACTB (actin beta) [NCBI Gene 60], B4GALT1 (beta-1,4-galactosyltransferase 1) [NCBI Gene 2683], KLF5 (KLF transcription factor 5) [NCBI Gene 688], KRT18 (keratin 18) [NCBI Gene 3875], SLC34A2 (solute carrier family 34 member 2) [NCBI Gene 10568], S100A11 (S100 calcium binding protein A11) [NCBI Gene 6282], GPRC5A (G protein-coupled receptor class C group 5 member A) [NCBI Gene 9052]
- **Diseases:** lung cancer (MONDO:0005138)

## Full-text entities

- **Genes:** TGFB1 (transforming growth factor beta 1) [NCBI Gene 7040] {aka CAEND1, CED, DPD1, IBDIMDE, LAP, TGF-beta1}, LAMA5 (laminin subunit alpha 5) [NCBI Gene 3911] {aka BBDS2, NPHS26}, KRT18 (keratin 18) [NCBI Gene 3875] {aka CK-18, CYK18, K18}, SLC34A2 (solute carrier family 34 member 2) [NCBI Gene 10568] {aka NAPI-3B, NAPI-IIb, NPTIIb, NaPi2b, PULAM}, ACTB (actin beta) [NCBI Gene 60] {aka BKRNS, BNS, BRWS1, CSMH, DDS1, PS1TP5BP1}, GPRC5A (G protein-coupled receptor class C group 5 member A) [NCBI Gene 9052] {aka GPCR5A, PEIG-1, RAI3, RAIG1, TIG1}, B4GALT1 (beta-1,4-galactosyltransferase 1) [NCBI Gene 2683] {aka B4GAL-T1, CDG2D, CLDLFIB, GGTB2, GT1, GTB}, S100A11 (S100 calcium binding protein A11) [NCBI Gene 6282] {aka HEL-S-43, MLN70, S100C}, KLF5 (KLF transcription factor 5) [NCBI Gene 688] {aka BTEB2, CKLF, IKLF}, CD8A (CD8 subunit alpha) [NCBI Gene 925] {aka CD8, CD8alpha, IMD116, Leu2, p32}, MTOR (mechanistic target of rapamycin kinase) [NCBI Gene 2475] {aka FRAP, FRAP1, FRAP2, RAFT1, RAPT1, SKS}, IGF1R (insulin like growth factor 1 receptor) [NCBI Gene 3480] {aka CD221, IGFIR, IGFR, JTK13}, ENTREP1 (endosomal transmembrane epsin interactor 1) [NCBI Gene 9413] {aka C9orf61, ENTREP, FAM189A2, X123}
- **Diseases:** Lung cancer (MESH:D008175), cancer (MESH:D009369), inflammatory (MESH:D007249)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12816179/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12816179/full.md

## References

28 references — full list in the complete paper: https://tomesphere.com/paper/PMC12816179/full.md

---
Source: https://tomesphere.com/paper/PMC12816179