# Transcriptomic Profiling of Gastric Cancer Reveals Key Biomarkers and Pathways via Bioinformatic Analysis

**Authors:** Ipek Balikci Cicek, Zeynep Kucukakcali

PMC · DOI: 10.3390/genes16070829 · 2025-07-16

## TL;DR

This study uses transcriptomic data and machine learning to identify key biomarkers and pathways in gastric cancer, offering potential for early detection and treatment.

## Contribution

The study integrates multiple transcriptomic datasets and machine learning to identify consistent, biologically relevant biomarkers for gastric cancer.

## Key findings

- 627 differentially expressed genes were identified in the discovery dataset, with key genes including CST1, KIAA1199, TIMP1, MSLN, and ATP4A.
- A random forest model achieved excellent classification performance with an AUC of 0.952 for gastric cancer detection.
- Cross-platform validation confirmed 55.6% concordance among core genes, highlighting CST1, TIMP1, KRT16, and ATP4A as strong diagnostic candidates.

## Abstract

Background/Objectives: Gastric cancer (GC) remains a significant global health burden due to its high mortality rate and frequent diagnosis at advanced stages. This study aimed to identify reliable diagnostic biomarkers and elucidate molecular mechanisms underlying GC by integrating transcriptomic data from independent platforms and applying machine learning techniques. Methods: Two transcriptomic datasets from the Gene Expression Omnibus were analyzed: GSE26899 (microarray, n = 108) as the discovery dataset and GSE248612 (RNA-seq, n = 12) for validation. Differential expression analysis was conducted using limma and DESeq2, selecting genes with |log2FC| > 1 and adjusted p < 0.05. The top 200 differentially expressed genes (DEGs) were used to develop machine learning models (random forest, logistic regression, neural networks). Functional enrichment analyses (GO, KEGG, Hallmark) were applied to explore relevant biological pathways. Results: In GSE26899, 627 DEGs were identified (201 upregulated, 426 downregulated), with key genes including CST1, KIAA1199, TIMP1, MSLN, and ATP4A. The random forest model demonstrated excellent classification performance (AUC = 0.952). GSE248612 validation yielded 738 DEGs. Cross-platform comparison confirmed 55.6% concordance among core genes, highlighting CST1, TIMP1, KRT17, ATP4A, CHIA, KRT16, and CRABP2. Enrichment analyses revealed involvement in ECM–receptor interaction, PI3K-Akt signaling, EMT, and cell cycle. Conclusions: This integrative transcriptomic and machine learning framework effectively identified high-confidence biomarkers for GC. Notably, CST1, TIMP1, KRT16, and ATP4A emerged as consistent, biologically relevant candidates with strong diagnostic performance and potential clinical utility. These findings may aid early detection strategies and guide future therapeutic developments in gastric cancer.

## Linked entities

- **Genes:** CST1 (cystatin SN) [NCBI Gene 1469], CEMIP (cell migration inducing hyaluronidase 1) [NCBI Gene 57214], TIMP1 (TIMP metallopeptidase inhibitor 1) [NCBI Gene 7076], MSLN (mesothelin) [NCBI Gene 10232], ATP4A (ATPase H+/K+ transporting subunit alpha) [NCBI Gene 495], KRT17 (keratin 17) [NCBI Gene 3872], CHIA (chitinase acidic) [NCBI Gene 27159], KRT16 (keratin 16) [NCBI Gene 3868], CRABP2 (cellular retinoic acid binding protein 2) [NCBI Gene 1382]
- **Diseases:** gastric cancer (MONDO:0001056)

## Full-text entities

- **Genes:** CRABP2 (cellular retinoic acid binding protein 2) [NCBI Gene 1382] {aka CRABP-II, RBP6}, TIMP1 (TIMP metallopeptidase inhibitor 1) [NCBI Gene 7076] {aka CLGI, EPA, EPO, HCI, TIMP, TIMP-1}, CST1 (cystatin SN) [NCBI Gene 1469], CEMIP (cell migration inducing hyaluronidase 1) [NCBI Gene 57214] {aka CCSP1, CEMIP1, HYBID, KIAA1199, TMEM2L}, AKT1 (AKT serine/threonine kinase 1) [NCBI Gene 207] {aka AKT, PKB, PKB-ALPHA, PRKBA, RAC, RAC-ALPHA}, KRT17 (keratin 17) [NCBI Gene 3872] {aka 39.1, CK-17, K17, PC2, PCHC1}, MSLN (mesothelin) [NCBI Gene 10232] {aka MPF, SMRP}, PIK3CB (phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit beta) [NCBI Gene 5291] {aka P110BETA, PI3K, PI3KBETA, PIK3C1}, ATP4A (ATPase H+/K+ transporting subunit alpha) [NCBI Gene 495] {aka ATP6A}, CHIA (chitinase acidic) [NCBI Gene 27159] {aka AMCASE, CHIT2, TSA1902}, KRT16 (keratin 16) [NCBI Gene 3868] {aka CK16, FNEPPK, K16, K1CP, KRT16A, NEPPK}
- **Diseases:** GC (MESH:D013274)

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12296158/full.md

---
Source: https://tomesphere.com/paper/PMC12296158