# Compact deep learning models for colon histopathology focusing performance and generalization challenges

**Authors:** Fareeha Hanif, Ali Raza, Heba Abdelgader Mohammed

PMC · DOI: 10.1038/s41598-026-35119-y · Scientific Reports · 2026-01-19

## TL;DR

This paper introduces compact deep learning models for colon cancer histopathology classification, achieving high accuracy but facing generalization challenges on unseen data.

## Contribution

The paper introduces and evaluates four lightweight CNN models for colon histopathology classification, highlighting their performance and generalization limitations.

## Key findings

- Lite-V2 achieved near-perfect validation performance with macro-F1 ≈ 0.999 and compact size (1.53 MB).
- Lite-V2 showed significant generalization drop on the test set with macro-F1 = 0.33, indicating domain shift.
- The study emphasizes the need for domain adaptation and stain-robust training for reliable deployment.

## Abstract

Colorectal cancer is a leading cause of cancer-related mortality, and accurate analysis of histopathological images is critical for early diagnosis and improved patient outcomes. This study proposes and systematically evaluates four purpose-built lightweight convolutional neural network (CNN) variants (Lite-V0, Lite-V1, Lite-V2, and Lite-V4) for binary classification of colon histopathology images into Colon_Adenocarcinoma and Colon_Benign_Tissue. Experiments were conducted on a balanced dataset (24,000 images) with fixed train/validation/test splits and comprehensive evaluation using accuracy and macro-F1, supported by confusion matrices and ROC/precision–recall analyses. Among all variants, Lite-V2 achieved the best validation performance (macro-F1 \documentclass[12pt]{minimal}
				\usepackage{amsmath}
				\usepackage{wasysym} 
				\usepackage{amsfonts} 
				\usepackage{amssymb} 
				\usepackage{amsbsy}
				\usepackage{mathrsfs}
				\usepackage{upgreek}
				\setlength{\oddsidemargin}{-69pt}
				\begin{document}$$\approx$$\end{document} 0.999), while remaining highly compact (1.53 MB; 127,682 parameters), indicating a favorable accuracy–efficiency trade-off for deployment-oriented diagnostic support. On the independent test set, however, Lite-V2 exhibited a marked generalization drop, achieving approximately 50% accuracy and macro-F1 = 0.33, suggesting a domain-shift effect between validation and test samples. These findings demonstrate that lightweight CNNs can achieve near-perfect internal validation performance for colon histopathology classification, but robust cross-domain generalization remains essential; future work will focus on domain adaptation and stain-robust training strategies to improve reliability on unseen clinical data.

## Linked entities

- **Diseases:** Colorectal cancer (MONDO:0005575), Colon_Adenocarcinoma (MONDO:0002271)

## Full-text entities

- **Diseases:** prostate cancer (MESH:D011471), dysplasia (MESH:D015792), colon and lung cancer (MESH:D008175), Adenocarcinoma (MESH:D000230), cancer (MESH:D009369), skin cancer (MESH:D012878), neurological disorders (MESH:D009461), Colon adenocarcinoma (MESH:D003110), brain disease (MESH:D001927), CRC (MESH:D015179), cardiovascular disease (MESH:D002318), diabetic retinopathy (MESH:D003930), breast cancer (MESH:D001943), gastrointestinal cancer (MESH:D005770), ocular disease (MESH:D005128), Colon (MESH:D003108), brain tumor (MESH:D001932)
- **Chemicals:** Lite (-), H&amp;E (MESH:D006371)
- **Species:** Vibrio sp. 2 (species) [taxon 387413], Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12886981/full.md

## Figures

17 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12886981/full.md

## References

53 references — full list in the complete paper: https://tomesphere.com/paper/PMC12886981/full.md

---
Source: https://tomesphere.com/paper/PMC12886981