# Transfer learning with Bayesian optimization for colorectal cancer histopathology classification

**Authors:** Houda Saif ALGhafri, Chia S. Lim

PMC · DOI: 10.1186/s12880-026-02149-x · BMC Medical Imaging · 2026-01-12

## TL;DR

This paper introduces CRC-BayTune, a Bayesian optimization method to improve deep learning models for colorectal cancer histopathology classification by tuning hyperparameters.

## Contribution

The novel contribution is applying Bayesian optimization to systematically tune hyperparameters in transfer learning models for CRC classification.

## Key findings

- DenseNet201, InceptionV3, InceptionResNetV2, and ResNet50V2 achieved high median MCC values (0.984-0.983).
- Model performance is significantly affected by both architecture and hyperparameter configuration.
- Deeper feature hierarchies showed more stable convergence and less accuracy degradation under noise.

## Abstract

Automated colorectal cancer (CRC) histopathology classification remains challenging due to variations in datasets, staining conditions, and tissue morphology across institutions. Many prior studies apply standard CNN architectures with fixed hyperparameters, leaving limited examination of how model choice and optimization strategies affect performance robustness across heterogeneous CRC data.

We evaluate eight transfer learning models on three-class CRC datasets and propose CRC-BayTune, applying Bayesian optimization to tune key training parameters, including learning rate, batch size, with fine-tuning depth. All models are assessed in patch-level experimental settings, and statistical significance is quantified using Friedman tests, repeated-measures ANOVA, and post hoc analyses. Robustness is assessed by introducing controlled Gaussian noise perturbations. Grad-CAM provides qualitative visual explanations by highlighting regions that contribute to model predictions.

DenseNet201, InceptionV3, InceptionResNetV2, and ResNet50V2 achieved the highest median MCC values of 0.984, 0.982, 0.975, and 0.983, respectively. Statistical analysis confirms that both model architecture (\documentclass[12pt]{minimal}
				\usepackage{amsmath}
				\usepackage{wasysym} 
				\usepackage{amsfonts} 
				\usepackage{amssymb} 
				\usepackage{amsbsy}
				\usepackage{mathrsfs}
				\usepackage{upgreek}
				\setlength{\oddsidemargin}{-69pt}
				\begin{document}$$\textrm{p}=0.059$$\end{document}, Friedman) and hyperparameter configuration (\documentclass[12pt]{minimal}
				\usepackage{amsmath}
				\usepackage{wasysym} 
				\usepackage{amsfonts} 
				\usepackage{amssymb} 
				\usepackage{amsbsy}
				\usepackage{mathrsfs}
				\usepackage{upgreek}
				\setlength{\oddsidemargin}{-69pt}
				\begin{document}$$\textrm{p}=0.027$$\end{document}, RM-ANOVA) significantly affect performance. Models with deeper feature hierarchies demonstrated more stable convergence and smaller accuracy degradation under noise.

The results show that systematic hyperparameter tuning can improve the training stability and classification performance of standard CNN models compared with fixed configurations in CRC histopathology tasks. The findings underscore that model performance in this setting is sensitive to choices such as learning rate, batch size, and fine-tuning depth, and that evaluating these factors explicitly can support more reliable use of deep learning models in computational pathology.

## Linked entities

- **Diseases:** colorectal cancer (MONDO:0005575)

## Full-text entities

- **Diseases:** colorectal cancer (MESH:D015179)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12888672/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12888672/full.md

## References

2 references — full list in the complete paper: https://tomesphere.com/paper/PMC12888672/full.md

---
Source: https://tomesphere.com/paper/PMC12888672