# Single-Cell Analysis Reveals Epithelial Heterogeneity and Tumor Microenvironment Characteristics During the Malignant Progression of Colorectal Cancer

**Authors:** Qianqian Chen, Yaoqian Yuan, Shuai Tian, Jiayan Zhou, Kunming Lv, Enqiang Linghu

PMC · DOI: 10.3390/biomedicines14020371 · Biomedicines · 2026-02-05

## TL;DR

This study uses single-cell sequencing to explore epithelial cell diversity and tumor microenvironment changes in colorectal cancer progression.

## Contribution

Identifies 11 epithelial cell subtypes and their roles in CRC progression, highlighting KCNMA1+ and MKI67+ cells as key indicators.

## Key findings

- Epithelial cells were divided into 11 subgroups marked by genes like MKI67 and KCNMA1.
- Downregulation of KCNMA1 and upregulation of MKI67 correlate with poor CRC prognosis.
- Cell–cell communication suggests bidirectional regulation between epithelial and fibroblast subsets.

## Abstract

Background/Objectives: To mine single-cell sequencing data for colorectal cancer (CRC), identify CRC epithelial cell subtypes, and explore the heterogeneity of epithelial cells and their impact on the tumor microenvironment (TME). Methods: The GSE201348 dataset, including normal, colorectal adenoma, high-grade colorectal intraepithelial neoplasia, and CRC tumor tissue samples, was downloaded from the Gene Expression Omnibus. The Seurat package of R software was used for data quality control, data integration, normalization, and clustering. The Feature Plot and the Recode function were executed to annotate and group the epithelial cells. Finally, genetic differences, copy number variant heterogeneity, pseudotime, cell–cell communication, and Gene Set Variation Analysis (GSVA) were further conducted. Results: In total, 26,335 gene matrices from 263,872 cells were obtained for subsequent analyses. Four cell clusters, including immune cells, fibroblasts, endothelial cells, and epithelial cells, were identified. Epithelial cells were further divided into 11 subgroups characterized by MKI67, SLC27A6, PLCE1, NKD1, KCNMA1, GDA, CLCA4, BEST4, LRMP, ACTG2, and ASPM. GSVA enrichment analysis suggested a role of the “P53 pathway,” “Wnt–β-catenin signaling,” and “MYC targets V1” pathways in epithelial cells during the malignant progression of tumors. Survival analysis indicated that downregulation of KCNMA1 and upregulation of MKI67 were associated with poor prognosis. Cell–cell communication analysis suggested a bidirectional regulatory role between epithelial and fibroblast subsets. Conclusions: This study analyzed the gene expression characteristics of 11 types of epithelial cells during the malignant progression of CRC. KCNMA1+ and MKI67+ epithelial subpopulations are important indicators for the malignant progression of CRC.

## Linked entities

- **Genes:** MKI67 (marker of proliferation Ki-67) [NCBI Gene 4288], SLC27A6 (solute carrier family 27 member 6) [NCBI Gene 28965], PLCE1 (phospholipase C epsilon 1) [NCBI Gene 51196], NKD1 (NKD inhibitor of Wnt signaling pathway 1) [NCBI Gene 85407], KCNMA1 (potassium calcium-activated channel subfamily M alpha 1) [NCBI Gene 3778], GDA (guanine deaminase) [NCBI Gene 9615], CLCA4 (chloride channel accessory 4) [NCBI Gene 22802], BEST4 (bestrophin 4) [NCBI Gene 266675], IRAG2 (inositol 1,4,5-triphosphate receptor associated 2) [NCBI Gene 4033], ACTG2 (actin gamma 2, smooth muscle) [NCBI Gene 72], ASPM (assembly factor for spindle microtubules) [NCBI Gene 259266], TP53 (tumor protein p53) [NCBI Gene 7157], MYC (MYC proto-oncogene, bHLH transcription factor) [NCBI Gene 4609]
- **Diseases:** colorectal cancer (MONDO:0005575), CRC (MONDO:0005575)

## Full-text entities

- **Genes:** IRAG2 (inositol 1,4,5-triphosphate receptor associated 2) [NCBI Gene 4033] {aka JAW1, LRMP}, LINC00486 (long intergenic non-protein coding RNA 486) [NCBI Gene 285045], PDGFRA (platelet derived growth factor receptor alpha) [NCBI Gene 5156] {aka CD140A, PDGFR-2, PDGFR2}, IFNA1 (interferon alpha 1) [NCBI Gene 3439] {aka IFL, IFN, IFN-ALPHA, IFN-alphaD, IFNA13, IFNA@}, GDA (guanine deaminase) [NCBI Gene 9615] {aka CYPIN, GAH, GUANASE, NEDASIN}, EPCAM (epithelial cell adhesion molecule) [NCBI Gene 4072] {aka Ber-Ep4, BerEp4, DIAR5, EGP-2, EGP314, EGP40}, BRAF (B-Raf proto-oncogene, serine/threonine kinase) [NCBI Gene 673] {aka B-RAF1, B-raf, BRAF-1, BRAF1, NS7, RAFB1}, HSP90AA1 (heat shock protein 90 alpha family class A member 1) [NCBI Gene 3320] {aka EL52, HEL-S-65p, HSP86, HSP89A, HSP90A, HSP90N}, Pecam1 (platelet/endothelial cell adhesion molecule 1) [NCBI Gene 18613] {aka Cd31, PECAM-1, Pecam}, MIR17 (microRNA 17) [NCBI Gene 406952] {aka MIR17-5p, MIR91, MIRN17, MIRN91, hsa-mir-17, miR-17}, TP53 (tumor protein p53) [NCBI Gene 7157] {aka BCC7, BMFS5, LFS1, P53, TRP53}, Col1a1 (collagen, type I, alpha 1) [NCBI Gene 12842] {aka Col1a-1, Cola-1, Cola1, Mov-13, Mov13}, KAT2B (lysine acetyltransferase 2B) [NCBI Gene 8850] {aka CAF, P/CAF, PCAF}, DES (desmin) [NCBI Gene 1674] {aka CDCD3, CSM1, CSM2, LGMD1D, LGMD1E, LGMD2R}, P2RY14 (purinergic receptor P2Y14) [NCBI Gene 9934] {aka BPR105, GPR105, P2Y14}, ATF7IP (activating transcription factor 7 interacting protein) [NCBI Gene 55729] {aka AM, ATF-IP, ATF7IP1, MCAF, MCAF1, p621}, FAP (fibroblast activation protein alpha) [NCBI Gene 2191] {aka DPPIV, FAPA, FAPalpha, SIMP}, PLCE1 (phospholipase C epsilon 1) [NCBI Gene 51196] {aka NPHS3, PLCE, PPLC}, GPX2 (glutathione peroxidase 2) [NCBI Gene 2877] {aka GI-GPx, GPRP, GPRP-2, GPx-2, GPx-GI, GSHPX-GI}, DPP6 (dipeptidyl peptidase like 6) [NCBI Gene 1804] {aka DPL1, DPPX, MRD33, VF2}, DDX5 (DEAD-box helicase 5) [NCBI Gene 1655] {aka G17P1, HLR1, HUMP68, p68}, ATF7IP2 (activating transcription factor 7 interacting protein 2) [NCBI Gene 80063] {aka MCAF2}, NFKB1 (nuclear factor kappa B subunit 1) [NCBI Gene 4790] {aka CVID12, EBP-1, KBF1, NF-kB, NF-kB1, NF-kappa-B1}, Epcam (epithelial cell adhesion molecule) [NCBI Gene 17075] {aka CD326, EGP, EGP-2, Egp314, Ep-CAM, EpCAM1}, ADAM28 (ADAM metallopeptidase domain 28) [NCBI Gene 10863] {aka ADAM 28, MDC-L, MDCL, eMDC II, eMDCII}, KCNMA1 (potassium calcium-activated channel subfamily M alpha 1) [NCBI Gene 3778] {aka BKTM, CADEDS, IEG16, KCa1.1, LIWAS, MaxiK}, Ptprc (protein tyrosine phosphatase receptor type C) [NCBI Gene 19264] {aka B220, CD45R, Cd45, L-CA, Ly-5, Lyt-4}, EPHA7 (EPH receptor A7) [NCBI Gene 2045] {aka EHK-3, EHK3, EK11, HEK11}, KRAS (KRAS proto-oncogene, GTPase) [NCBI Gene 3845] {aka 'C-K-RAS, C-K-RAS, CFC2, K-RAS2A, K-RAS2B, K-RAS4A}, Acta2 (actin alpha 2, smooth muscle, aorta) [NCBI Gene 11475] {aka 0610041G09Rik, Actvs, SMAalpha, SMalphaA, a-SMA, alphaSMA}, CTNNB1 (catenin beta 1) [NCBI Gene 1499] {aka CTNNB, EVR7, MRD19, NEDSDV, armadillo}, LAMA2 (laminin subunit alpha 2) [NCBI Gene 3908] {aka LAMM, MDC1A}, WASHC1 (WASH complex subunit 1) [NCBI Gene 100287171] {aka FAM39E, WASH, WASH1}, NGF (nerve growth factor) [NCBI Gene 4803] {aka Beta-NGF, HSAN5, NGFB}, ACTG2 (actin gamma 2, smooth muscle) [NCBI Gene 72] {aka ACT, ACTA3, ACTE, ACTL3, ACTSG, MMIHS5}, NEAT1 (nuclear paraspeckle assembly transcript 1) [NCBI Gene 283131] {aka LINC00084, NCRNA00084, TP53LC15, TncRNA, VINC}, CLCA4 (chloride channel accessory 4) [NCBI Gene 22802] {aka CaCC, CaCC2}, BEST4 (bestrophin 4) [NCBI Gene 266675] {aka VMD2L2}, ASPM (assembly factor for spindle microtubules) [NCBI Gene 259266] {aka ASP, Calmbp1, MCPH5}, TGFB1 (transforming growth factor beta 1) [NCBI Gene 7040] {aka CAEND1, CED, DPD1, IBDIMDE, LAP, TGF-beta1}, LINC00511 (long intergenic non-protein coding RNA 511) [NCBI Gene 400619] {aka LCAL5, onco-lncRNA-12}, Cd19 (CD19 antigen) [NCBI Gene 12478], RYR2 (ryanodine receptor 2) [NCBI Gene 6262] {aka ARVC2, ARVD2, RYR-2, RyR, VACRDS, VTSIP}, SOX6 (SRY-box transcription factor 6) [NCBI Gene 55553] {aka HSSOX6, SOXD, TOLCAS}, TNF (tumor necrosis factor) [NCBI Gene 7124] {aka DIF, IMD127, TNF-alpha, TNFA, TNFSF2, TNLG1F}, NKD1 (NKD inhibitor of Wnt signaling pathway 1) [NCBI Gene 85407] {aka Naked1}, SLC27A6 (solute carrier family 27 member 6) [NCBI Gene 28965] {aka ACSVL2, FACVL2, FATP6, VLCS-H1}, MKI67 (marker of proliferation Ki-67) [NCBI Gene 4288] {aka KIA, MIB-, MIB-1, PPP1R105}, UBE2K (ubiquitin conjugating enzyme E2 K) [NCBI Gene 3093] {aka E2-25K, HIP2, HYPG, LIG, UBC1}, PTK2 (protein tyrosine kinase 2) [NCBI Gene 5747] {aka FADK, FADK 1, FAK, FAK1, FRNK, PPP1R71}, MYC (MYC proto-oncogene, bHLH transcription factor) [NCBI Gene 4609] {aka MRTL, MYCC, bHLHe39, c-Myc}, MUC2 (mucin 2, oligomeric mucus/gel-forming) [NCBI Gene 4583] {aka MLP, MUC-2, SMUC}, NTRK1 (neurotrophic receptor tyrosine kinase 1) [NCBI Gene 4914] {aka MTC, TRK, TRK1, TRKA, Trk-A, p140-TrkA}
- **Diseases:** natural killer T-cell lymphoma (MESH:D000077428), lung squamous cell carcinoma (MESH:D002294), Hypoxia (MESH:D000860), carcinomatous (MESH:D055756), UMAP (MESH:C567162), liver hepatocellular carcinoma (MESH:D006528), CNV (OMIM:610141), Adenoma (MESH:D000236), tumorigenesis (MESH:D063646), gastric cancer (MESH:D013274), breast invasive carcinoma (MESH:D001943), lung adenocarcinoma (MESH:D000077192), colorectal intraepithelial neoplasia (MESH:D002578), Cancer (MESH:D009369), adenoma-carcinoma (MESH:D000230), precancerous (MESH:D011230), CRC (MESH:D015179), injury to (MESH:D014947), osteosarcoma (MESH:D012516)
- **Chemicals:** flavonoid (MESH:D005419), flavone (MESH:C043562), FP236383.1 (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Mutations:** serine/glycine
- **Cell lines:** fibroblasts — Mus musculus (Mouse), Spontaneously immortalized cell line (CVCL_0594), S2 — Drosophila melanogaster (Fruit fly), Spontaneously immortalized cell line (CVCL_Z232)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12938631/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12938631/full.md

## References

40 references — full list in the complete paper: https://tomesphere.com/paper/PMC12938631/full.md

---
Source: https://tomesphere.com/paper/PMC12938631