# Extracellular Matrix–Associated Biomarkers for Hepatocellular Carcinoma: Insights From Machine Learning and Single‐Cell Analysis

**Authors:** Pedram Asadi Sarabi, Elham Rismani, Amir Ali Judaki, Amirhossein Farrokhzad, Zahra Hendi, Moustapha Hassan, Massoud Vosough

PMC · DOI: 10.1155/ijog/6654142 · International Journal of Genomics · 2026-02-24

## TL;DR

Researchers identified eight ECM-related genes that could help detect and treat liver cancer earlier and more effectively.

## Contribution

The study introduces eight ECM-associated genes as novel diagnostic and prognostic biomarkers for hepatocellular carcinoma.

## Key findings

- Eight ECM-associated genes (CSPG4, CD34, C1orf35, ESM1, MAPT, PLXDC1, STC2, THBS4) were identified as upregulated diagnostic biomarkers.
- MAPT, PLXDC1, and STC2 were linked to poor overall survival in HCC patients.
- Single-cell RNA sequencing showed distinct cell-type associations for these genes, suggesting roles in tumor microenvironment dynamics.

## Abstract

The 5‐year overall survival rate for hepatocellular carcinoma (HCC) patients remains below 20%. Alterations in the extracellular matrix (ECM) are increasingly recognized as central drivers of HCC initiation and progression. This study applied a system biology framework integrating omics data and machine learning to analyze gene expression and regulatory networks in HCC using The Cancer Genome Atlas. Eight ECM‐associated genes (CSPG4, CD34, C1orf35, ESM1, MAPT, PLXDC1, STC2, and THBS4) were identified as upregulated diagnostic biomarkers with strong discriminatory power. Among them, MAPT, PLXDC1, and STC2 showed significant associations with poor overall survival, defining a prognostic subset. Validation in the GSE104310 and GSE144269 datasets confirmed consistent expression patterns across cohorts. Functional enrichment linked these genes to tissue remodeling and angiogenesis. Single‐cell RNA sequencing revealed MAPT upregulation in T cells, PLXDC1 enrichment in cancer‐associated fibroblasts, and mild STC2 elevation in tumor‐associated macrophages and endothelial cells. These findings identify key ECM‐based biomarkers with potential for early detection, prognosis, and therapeutic targeting in HCC.

## Linked entities

- **Genes:** CSPG4 (chondroitin sulfate proteoglycan 4) [NCBI Gene 1464], CD34 (CD34 molecule) [NCBI Gene 947], C1orf35 (chromosome 1 open reading frame 35) [NCBI Gene 79169], ESM1 (endothelial cell specific molecule 1) [NCBI Gene 11082], MAPT (microtubule associated protein tau) [NCBI Gene 4137], PLXDC1 (plexin domain containing 1) [NCBI Gene 57125], STC2 (stanniocalcin 2) [NCBI Gene 8614], THBS4 (thrombospondin 4) [NCBI Gene 7060]
- **Diseases:** hepatocellular carcinoma (MONDO:0007256)

## Full-text entities

- **Genes:** AKT1 (AKT serine/threonine kinase 1) [NCBI Gene 207] {aka AKT, PKB, PKB-ALPHA, PRKBA, RAC, RAC-ALPHA}, MYC (MYC proto-oncogene, bHLH transcription factor) [NCBI Gene 4609] {aka MRTL, MYCC, bHLHe39, c-Myc}, STC2 (stanniocalcin 2) [NCBI Gene 8614] {aka STC-2, STCRP}, PLXDC1 (plexin domain containing 1) [NCBI Gene 57125] {aka TEM3, TEM7}, THBS4 (thrombospondin 4) [NCBI Gene 7060] {aka TSP-4, TSP4}, CD34 (CD34 molecule) [NCBI Gene 947], ESM1 (endothelial cell specific molecule 1) [NCBI Gene 11082] {aka endocan}, TP53 (tumor protein p53) [NCBI Gene 7157] {aka BCC7, BMFS5, LFS1, P53, TRP53}, MAPT (microtubule associated protein tau) [NCBI Gene 4137] {aka DDPAC, FTD1, FTDP-17, MAPTL, MSTD, MTBT1}, AFP (alpha fetoprotein) [NCBI Gene 174] {aka AFPD, FETA, HPAFP}, PTK2 (protein tyrosine kinase 2) [NCBI Gene 5747] {aka FADK, FADK 1, FAK, FAK1, FRNK, PPP1R71}, ITGB1 (integrin subunit beta 1) [NCBI Gene 3688] {aka CD29, FNRB, GPIIA, MDF2, MSK12, VLA-BETA}, C1orf35 (chromosome 1 open reading frame 35) [NCBI Gene 79169] {aka MMTAG2}, PIK3CB (phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit beta) [NCBI Gene 5291] {aka P110BETA, PI3K, PI3KBETA, PIK3C1}, MMRN1 (multimerin 1) [NCBI Gene 22915] {aka ECM, EMILIN4, GPIa*, MMRN}, CSPG4 (chondroitin sulfate proteoglycan 4) [NCBI Gene 1464] {aka CSPG4A, HMW-MAA, MCSP, MCSPG, MEL-CSPG, MSK16}
- **Diseases:** inflammation (MESH:D007249), Cancer (MESH:D009369), carcinogenesis (MESH:D063646), TAM (MESH:D020914), hypoxia (MESH:D000860), tumor endothelial marker 7 (MESH:D005600), OS (MESH:D011475), deaths (MESH:D003643), metastasis (MESH:D009362), gastrointestinal and hepatic cancers (MESH:D005770), Hepatocellular carcinoma (MESH:D006528)
- **Chemicals:** carbohydrates (MESH:D002241), calcium (MESH:D002118)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12932914/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12932914/full.md

## References

53 references — full list in the complete paper: https://tomesphere.com/paper/PMC12932914/full.md

---
Source: https://tomesphere.com/paper/PMC12932914