# Uncovering Hidden Prognostic Patterns in Colorectal Cancer Histology Using Unsupervised Learning: A Computational Pathology Study

**Authors:** Wen-Tong Zhou, Yong Liu, Gang Yu, Kuan-Song Wang, Chao Xu, Jonathan Greenbaum, Chong Wu, Lin-Dong Jiang, Christopher J. Papasian, Hong-Mei Xiao, Hong-Wen Deng

PMC · DOI: 10.3390/bioengineering13030334 · Bioengineering · 2026-03-13

## TL;DR

This study uses unsupervised deep learning to find hidden patterns in colorectal cancer tissue images that predict patient outcomes better than current methods.

## Contribution

A novel computational pathology framework identifies histomorphological clusters with independent prognostic value in colorectal cancer.

## Key findings

- Three histomorphological clusters (Cluster13, Cluster19, Cluster24) were consistently associated with patient prognosis.
- A risk model combining clusters with clinical factors improved survival prediction in training and validation sets.
- Incorporating clusters with clinical data modestly but significantly enhanced predictive performance.

## Abstract

Colorectal cancer (CRC) remains a leading cause of cancer mortality globally, yet current histopathological diagnostics capture only limited features. This study aimed to discover subtle, prognostically significant histomorphological patterns in CRC tissues using unsupervised deep learning. We developed a framework integrating convolutional neural networks with deep clustering, trained on 23,341 image patches from 493 patients. We identified 30 distinct histomorphological clusters from CRC tissue images. Through univariate and multivariate survival analyses, three clusters (Cluster13, Cluster19, and Cluster24) were consistently associated with patient prognosis. These clusters were integrated with clinical factors (T stage, N stage, and differentiation degree) to construct a prognostic risk model. Patients stratified into high-risk and low-risk groups based on model predictions showed significant survival differences in both the training set (N = 493) and an independent validation set (N = 2590). Furthermore, logistic regression and multivariate Cox analyses demonstrated that incorporating the three histomorphological clusters alongside clinical factors yielded a modest but statistically significant improvement in predictive performance compared to clinical factors alone, indicating their complementary value for prognosis. This work demonstrates that computational pathology can uncover novel, visually elusive morphological features with independent prognostic value, offering potential to refine CRC patient stratification and inform clinical decision-making.

## Linked entities

- **Diseases:** colorectal cancer (MONDO:0005575), CRC (MONDO:0005575)

## Full-text entities

- **Diseases:** CRC (MESH:D015179), cancer (MESH:D009369)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13024107/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13024107/full.md

## References

62 references — full list in the complete paper: https://tomesphere.com/paper/PMC13024107/full.md

---
Source: https://tomesphere.com/paper/PMC13024107