# Identification and validation of butyrate metabolism-related biomarkers for colorectal cancer diagnosis

**Authors:** Miao Yu, Qian Chen, Tianhe Gu, Yiping Lu

PMC · DOI: 10.7717/peerj.20942 · PeerJ · 2026-03-23

## TL;DR

This study identifies six biomarkers related to butyrate metabolism that could help diagnose colorectal cancer and improve understanding of its metabolic changes.

## Contribution

The study introduces six validated butyrate metabolism-related biomarkers for colorectal cancer diagnosis using multi-omics and experimental validation.

## Key findings

- Sixty-three differentially expressed butyrate metabolism-related genes were identified and enriched in CRC-related pathways.
- Six hub biomarkers (CCND1, CXCL8, MMP3, MYC, TIMP1, and VEGFA) were validated using PPI, LASSO, and ROC analyses.
- qRT-PCR confirmed upregulation of CCND1, CXCL8, MYC, and VEGFA in CRC cell lines.

## Abstract

Unlike normal colon cells with butyrate acid as the main energy source, cancerous colon cells are more inclined to use glucose. However, the mechanisms of the investigation into the modulatory role of butyrate metabolism within the pathophysiology of colorectal cancer (CRC) remains insufficiently explored.

The study analyzed four datasets (The Cancer Genome Atlas (TCGA)-COAD, TCGA-READ, GSE41258, and GSE39582) and gene sets related to butyrate metabolism-related genes (BMGs) in an integrated manner. Differentially expressed BMGs (DE-BMGs) were screened by overlapping BMGs, TCGA-DEGs between CRC and normal groups, and Gene Expression Omnibus (GEO)-differentially expressed genes (DEGs) between CRC and normal groups and were subjected to enrichment analysis. Hub genes were then screened via protein–protein interaction (PPI) network analysis. Biomarker selection was improved by applying the least absolute shrinkage and selection operator (LASSO) and receiver operating characteristic (ROC) curve analyses. Subgroup survival analyses were stratified according to different clinical phenotypes. A regulatory network modeled on competitive endogenous RNA was subsequently constructed. Finally, based on normal colon epithelial cells (NCM-460) and colon cancer cells (LOVO, HCT116, LS174T, and LS513), we detected the differential expression of biomarkers between the two groups using quantitative real-time polymerase chain reaction (qRT-PCR) methods.

Sixty-three DE-BMGs were obtained. Enrichment analysis showed significant correlations between DE-BMGs and signaling receptor activator activity and peroxisome proliferator-activated receptor-dominated pathways. Subsequently, six total biomarkers (CCND1, CXCL8, MMP3, MYC, TIMP1, and VEGFA) were obtained via PPI, LASSO, and ROC curve validation analyses. Survival analysis revealed significant differences in survival metrics between different clinical cohorts. Ingenuity pathway analysis demonstrated that pathways associated with identified biomarkers were disrupted, especially those associated with the tumor microenvironment. Finally, a computational prediction model was developed for 156 pharmacological agents targeting five key biomarkers: CCND1, CXCL8, MMP3, MYC, and VEGFA. The results of the qRT-PCR study indicated that CCND1, CXCL8, MYC, and VEGFA were upregulated in CRC cell lines, an observation consistent with existing public database records.

Six butyrate metabolism-related biomarkers (CCND1, CXCL8, MMP3, MYC, TIMP1, and VEGFA) were screened out to provide a basis for exploring the prediction of CRC diagnosis.

## Linked entities

- **Genes:** CCND1 (cyclin D1) [NCBI Gene 595], CXCL8 (C-X-C motif chemokine ligand 8) [NCBI Gene 3576], MMP3 (matrix metallopeptidase 3) [NCBI Gene 4314], MYC (MYC proto-oncogene, bHLH transcription factor) [NCBI Gene 4609], TIMP1 (TIMP metallopeptidase inhibitor 1) [NCBI Gene 7076], VEGFA (vascular endothelial growth factor A) [NCBI Gene 7422]
- **Diseases:** colorectal cancer (MONDO:0005575)

## Full-text entities

- **Genes:** VEGFA (vascular endothelial growth factor A) [NCBI Gene 7422] {aka L-VEGF, MVCD1, VEGF, VPF}, CCND1 (cyclin D1) [NCBI Gene 595] {aka BCL1, D11S287E, PRAD1, U21B31}, MMP3 (matrix metallopeptidase 3) [NCBI Gene 4314] {aka CHDS6, MMP-3, SL-1, STMY, STMY1, STR1}, TIMP1 (TIMP metallopeptidase inhibitor 1) [NCBI Gene 7076] {aka CLGI, EPA, EPO, HCI, TIMP, TIMP-1}, CXCL8 (C-X-C motif chemokine ligand 8) [NCBI Gene 3576] {aka GCP-1, GCP1, IL8, LECT, LUCT, LYNAP}, MYC (MYC proto-oncogene, bHLH transcription factor) [NCBI Gene 4609] {aka MRTL, MYCC, bHLHe39, c-Myc}
- **Diseases:** Cancer (MESH:D009369), CRC (MESH:D015179)
- **Chemicals:** glucose (MESH:D005947), butyrate (MESH:D002087), butyrate acid (-)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13020437/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13020437/full.md

## References

46 references — full list in the complete paper: https://tomesphere.com/paper/PMC13020437/full.md

---
Source: https://tomesphere.com/paper/PMC13020437