# CellNavi predicts genes directing cellular transitions by learning a gene graph-enhanced cell state manifold

**Authors:** Tianze Wang, Yan Pan, Fusong Ju, Shuxin Zheng, Chang Liu, Yaosen Min, Qun Jiang, Xinwei Liu, Huanhuan Xia, Guoqing Liu, Haiguang Liu, Pan Deng

PMC · DOI: 10.1038/s41556-025-01755-1 · Nature Cell Biology · 2025-10-03

## TL;DR

CellNavi uses deep learning to identify key genes that drive cell state changes, helping understand disease and drug responses.

## Contribution

CellNavi introduces a gene graph-enhanced cell state manifold for accurate driver gene prediction across diverse conditions.

## Key findings

- CellNavi accurately predicts driver genes for transitions caused by genetic, chemical, and cytokine perturbations.
- The framework works across diverse cell types, conditions, and studies, including differentiation and disease progression.
- CellNavi improves driver gene prediction and cell state manipulation for disease biology and therapeutic discovery.

## Abstract

A select few genes act as pivotal drivers in the process of cell state transitions. However, finding key genes involved in different transitions is challenging. Here, to address this problem, we present CellNavi, a deep learning-based framework designed to predict genes that drive cell state transitions. CellNavi builds a driver gene predictor upon a cell state manifold, which captures the intrinsic features of cells by learning from large-scale, high-dimensional transcriptomics data and integrating gene graphs with directional connections. Our analysis shows that CellNavi can accurately predict driver genes for transitions induced by genetic, chemical and cytokine perturbations across diverse cell types, conditions and studies. By leveraging a biologically meaningful cell state manifold, it is proficient in tasks involving critical transitions such as cellular differentiation, disease progression and drug response. CellNavi represents a substantial advancement in driver gene prediction and cell state manipulation, opening new avenues in disease biology and therapeutic discovery.

The authors integrate single-cell transcriptomic data with prior gene graphs to produce a biologically meaningful cell state manifold that can predict driver genes for genetic perturbations and differentiation events across diverse cell types.

## Full-text entities

- **Genes:** TLCD3B (TLC domain containing 3B) [NCBI Gene 83723] {aka CORD22, FAM57B, FP1188}, MRPS31 (mitochondrial ribosomal protein S31) [NCBI Gene 10240] {aka IMOGN38, MRP-S31, S31mt, mS31}, NDUFS8 (NADH:ubiquinone oxidoreductase core subunit S8) [NCBI Gene 4728] {aka CI-23k, CI23KD, MC1DN2, TYKY}, HDAC9 (histone deacetylase 9) [NCBI Gene 9734] {aka HD7, HD7b, HD9, HDAC, HDAC7B, HDAC9B}, IL21 (interleukin 21) [NCBI Gene 59067] {aka CVID11, IL-21, Za11}, CD27 (CD27 molecule) [NCBI Gene 939] {aka S152, S152. LPFS2, T14, TNFRSF7, Tp55}, IL2 (interleukin 2) [NCBI Gene 3558] {aka IL-2, TCGF, lymphokine}, APEX1 (apurinic/apyrimidinic endodeoxyribonuclease 1) [NCBI Gene 328] {aka APE, APE1, APEN, APEX, APX, HAP1}, NDUFB7 (NADH:ubiquinone oxidoreductase subunit B7) [NCBI Gene 4713] {aka B18, CI-B18, MC1DN39}, CELF1 (CUGBP Elav-like family member 1) [NCBI Gene 10658] {aka BRUNOL2, CUG-BP, CUGBP, CUGBP1, EDEN-BP, NAB50}, RPRD1B (regulation of nuclear pre-mRNA domain containing 1B) [NCBI Gene 58490] {aka C20orf77, CREPT, K-H, Kub5-Hera, NET60, dJ1057B20.2}, CD28 (CD28 molecule) [NCBI Gene 940] {aka IMD123, Tp44}, CCL4 (C-C motif chemokine ligand 4) [NCBI Gene 6351] {aka ACT2, AT744.1, G-26, HC21, LAG-1, LAG1}, IL9R (interleukin 9 receptor) [NCBI Gene 3581] {aka CD129, IL-9R}, POLR2A (RNA polymerase II subunit A) [NCBI Gene 5430] {aka NEDHIB, POLR2, POLRA, RPB1, RPBh1, RPO2}, GRN (granulin precursor) [NCBI Gene 2896] {aka CLN11, FTD2, GEP, GP88, PCDGF, PEPI}, RPRD1A (regulation of nuclear pre-mRNA domain containing 1A) [NCBI Gene 55197] {aka HsT3101, P15RS}, FARP1 (FERM, ARH/RhoGEF and pleckstrin domain protein 1) [NCBI Gene 10160] {aka CDEP, FARP1-IT1, GLCC1, PLEKHC2, PPP1R75}, GATA3 (GATA binding protein 3) [NCBI Gene 2625] {aka HDR, HDRS}, VAV1 (vav guanine nucleotide exchange factor 1) [NCBI Gene 7409] {aka VAV}, HYOU1 (hypoxia up-regulated 1) [NCBI Gene 10525] {aka GRP-170, Grp170, HSP12A, IMD59, ORP-150, ORP150}, SPINT2 (serine peptidase inhibitor, Kunitz type 2) [NCBI Gene 10653] {aka DIAR3, HAI-2, HAI2, Kop, PB}, HSPA5 (heat shock protein family A (Hsp70) member 5) [NCBI Gene 3309] {aka BIP, GRP78, HEL-S-89n}, CPSF6 (cleavage and polyadenylation specific factor 6) [NCBI Gene 11052] {aka CFIM, CFIM68, CFIM72, HPBRII-4, HPBRII-7}, TAGAP (T cell activation RhoGTPase activating protein) [NCBI Gene 117289] {aka ARHGAP47, FKSG15, IDDM21, TAGAP1}, PRPF3 (pre-mRNA processing factor 3) [NCBI Gene 9129] {aka HPRP3, HPRP3P, PRP3, Prp3p, RP18, SNRNP90}, HDAC2 (histone deacetylase 2) [NCBI Gene 3066] {aka HD2, KDAC2, RPD3, YAF1}, MRPS26 (mitochondrial ribosomal protein S26) [NCBI Gene 64949] {aka C20orf193, GI008, MRP-S13, MRP-S26, MRPS13, NY-BR-87}, NUDT21 (nudix hydrolase 21) [NCBI Gene 11051] {aka CFIM25, CPSF5}, F3 (coagulation factor III, tissue factor) [NCBI Gene 2152] {aka CD142, TF, TFA}, CCL3L3 (C-C motif chemokine ligand 3 like 3) [NCBI Gene 414062] {aka 464.2, D17S1718, G0S19-2, LD78, LD78BETA, SCYA3L}, G0S2 (G0/G1 switch 2) [NCBI Gene 50486], CD4 (CD4 molecule) [NCBI Gene 920] {aka CD4mut, IMD79, Leu-3, OKT4D, T4}, EIF2S1 (eukaryotic translation initiation factor 2 subunit alpha) [NCBI Gene 1965] {aka EIF-2, EIF-2A, EIF-2alpha, EIF2, EIF2A}, BAX (BCL2 associated X, apoptosis regulator) [NCBI Gene 581] {aka BCL2L4}, VCP (valosin containing protein) [NCBI Gene 7415] {aka CDC48, FTDALS6, TERA, p97}, IFNG (interferon gamma) [NCBI Gene 3458] {aka IFG, IFI, IMD69}, LIMA1 (LIM domain and actin binding 1) [NCBI Gene 51474] {aka EPLIN, LDLCQ8, SREBP3}, DCP2 (decapping mRNA 2) [NCBI Gene 167227] {aka NUDT20}, CYCS (cytochrome c, somatic) [NCBI Gene 54205] {aka CYC, HCS, THC4}, CCL3 (C-C motif chemokine ligand 3) [NCBI Gene 6348] {aka G0S19-1, LD78, LD78ALPHA, MIP-1-alpha, MIP1A, SCI}
- **Diseases:** cancer (MESH:D009369), GRNs (MESH:C537680), LISI (MESH:C537340), neurodegenerative disease (MESH:D019636), PD (MESH:D010300), chronic myelogenous leukaemia (MESH:D015464)
- **Chemicals:** Ac-CoA (-), dopamine (MESH:D004298), aniline (MESH:C023650), zinc (MESH:D015032), entinostat (MESH:C118739), panobinostat (MESH:D000077767), Tunicamycin (MESH:D014415), acetyl-CoA (MESH:D000105), acetate (MESH:D000085), tacedinaline (MESH:C081895), mocetinostat (MESH:C523184)
- **Species:** Homo sapiens (human, species) [taxon 9606], Mus musculus (house mouse, species) [taxon 10090]
- **Cell lines:** K62 — Homo sapiens (Human), Induced pluripotent stem cell (CVCL_C0GN), K562 — Homo sapiens (Human), Blast phase chronic myelogenous leukemia, BCR-ABL1 positive, Cancer cell line (CVCL_0004), HEK293FT — Homo sapiens (Human), Transformed cell line (CVCL_6911)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12527915/full.md

## Figures

16 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12527915/full.md

## References

17 references — full list in the complete paper: https://tomesphere.com/paper/PMC12527915/full.md

---
Source: https://tomesphere.com/paper/PMC12527915