# Development And Validation of An RNA Binding Protein-Associated Prognostic Model for Colon Adenocarcinoma

**Authors:** Xiajing Yu, Daixin Guo, Jie Gao, Jialing Hu, Wenyige Zhang, Qijun Yang, Jingyi Wang, Yingcheng He, Kaili Liao, Xiaozhong Wang

PMC · DOI: 10.7150/jca.103477 · Journal of Cancer · 2025-05-18

## TL;DR

This study identifies four RNA-binding proteins that predict survival in colon cancer patients and builds a cost-effective prognostic model.

## Contribution

A novel RNA-binding protein-based prognostic model for colon adenocarcinoma with clinical applicability is developed and validated.

## Key findings

- 472 differentially expressed RNA-binding proteins were identified in colon cancer.
- Four key RBPs (MSI2, EZH2, NCL, TERT) were used to build a prognostic model.
- The model showed good performance with AUCs of 0.607 and 0.638 in training and validation sets.

## Abstract

Purpose: We aimed to identify prognostic RNA-binding proteins (RBP) in colon cancer, analyze their biological functions, and develop predictive models for patient prognosis.

Materials and Methods: We downloaded COAD's RNA sequencing data from the Cancer Genome Atlas (TCGA) database, and the expression and prognostic value of these RBPs in COAD were systematically evaluated. Differential expression, KEGG, and GO enrichment analyses were then performed. Cytoscape was used to visualize the protein-protein interaction network, and Cox regression was used to establish a predictive model. Finally, the expression of RBP was verified by the HPA database and immunohistochemical staining.

Results: A total of 472 differentially expressed RBPs were detected, including 321 up-regulated RBPs and 151 down-regulated RBPs. Four RBPs (MSI2, EZH2, NCL, TERT) were identified as key prognostic genes and used to construct prognostic models, based on this model, the overall survival (OS) of patients in high-risk subgroup was worse than that of patients in the low-risk subgroup. The area under the curve of time-dependent receiver operator characteristic curve of TCGA training set and Gene Expression Omnibus (GEO) validation set was 0.607 and 0.638 respectively, which confirmed that the prognosis model was good, it showed a good ability to identify COAD.

Conclusion: In general, our prognostic model is based on 4 RBPs encoding genes, which greatly reduces the cost of sequencing and is more conducive to clinical applications.

## Linked entities

- **Genes:** MSI2 (musashi RNA binding protein 2) [NCBI Gene 124540], EZH2 (enhancer of zeste 2 polycomb repressive complex 2 subunit) [NCBI Gene 2146], NUCLEOLIN (nucleolin multifunctional protein) [NCBI Gene 4691], TERT (telomerase reverse transcriptase) [NCBI Gene 7015]
- **Proteins:** MSI2 (musashi RNA binding protein 2), EZH2 (enhancer of zeste 2 polycomb repressive complex 2 subunit), NUCLEOLIN (nucleolin multifunctional protein), TERT (telomerase reverse transcriptase)
- **Diseases:** colon adenocarcinoma (MONDO:0002271), colon cancer (MONDO:0002032)

## Full-text entities

- **Genes:** MSI2 (musashi RNA binding protein 2) [NCBI Gene 124540] {aka MSI2H}, EZH2 (enhancer of zeste 2 polycomb repressive complex 2 subunit) [NCBI Gene 2146] {aka ENX-1, ENX1, EZH2b, KMT6, KMT6A, WVS}, NUCLEOLIN (nucleolin multifunctional protein) [NCBI Gene 4691] {aka C23, NCL, Nsr1}, TERT (telomerase reverse transcriptase) [NCBI Gene 7015] {aka CMM9, DKCA2, DKCB4, EST2, PFBMFT1, TCS1}
- **Diseases:** colon cancer (MESH:D015179), COAD (MESH:D029424), Cancer (MESH:D009369), Colon Adenocarcinoma (MESH:D003110)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12170993/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12170993/full.md

## References

30 references — full list in the complete paper: https://tomesphere.com/paper/PMC12170993/full.md

---
Source: https://tomesphere.com/paper/PMC12170993