# Deep learning for detecting early gastric cancer with white-light endoscopy: a systematic review and meta-analysis

**Authors:** Jixiang Liu, Danyan Li, Yudi Zhuo, Shengsheng Zhang

PMC · DOI: 10.3389/frai.2026.1734591 · Frontiers in Artificial Intelligence · 2026-01-29

## TL;DR

This paper reviews deep learning models for detecting early gastric cancer using endoscopy images and finds their performance comparable to expert doctors.

## Contribution

A systematic review and meta-analysis comparing deep learning models to expert endoscopists for early gastric cancer detection.

## Key findings

- DL models showed high sensitivity and specificity in internal validation (0.91 and 0.93).
- Diagnostic accuracy of DL models was comparable to expert endoscopists.
- Training dataset size significantly influenced model performance heterogeneity.

## Abstract

The aim of this study is to evaluate the performance of DL algorithms in diagnosing early gastric cancer (EGC) using white light endoscopic images.

A systematic literature search was conducted in PubMed, Embase, Cochrane Library, and Web of Science up to July 25, 2025. Sensitivity and specificity were pooled for internal and external validation sets. The comparison between DL algorithms and expert endoscopists was performed using paired forest plots. Meta-regression was used to identify sources of heterogeneity.

In the internal validation, 15 studies comprising 37,037 images (range: 433–9,650) were included. Pooled sensitivity and specificity were 0.91 (95% CI: 0.82–0.95) and 0.93 (95% CI: 0.87–0.97), respectively. Meta-regression showed that heterogeneity in sensitivity and specificity was significantly associated with training dataset size. For external validation, 4 studies with 3,579 images (range: 200–1,514) were included, yielding pooled sensitivity and specificity of 0.82 (95% CI: 0.61–0.93) and 0.83 (95% CI: 0.74–0.90), respectively. No significant difference was observed between deep learning models and expert endoscopists in diagnostic sensitivity and specificity.

Deep learning algorithms exhibit high diagnostic performance in detecting early gastric cancer using white-light endoscopy. The diagnostic accuracy of DL models is comparable to that of expert endoscopists, supporting their potential role as a clinical decision-support tool.

https://www.crd.york.ac.uk/PROSPERO/view/CRD420251112418, identifier CRD420251112418.

## Linked entities

- **Diseases:** gastric cancer (MONDO:0001056)

## Full-text entities

- **Diseases:** DL (MESH:C537113), EGC (MESH:D013274)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12894240/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12894240/full.md

## References

62 references — full list in the complete paper: https://tomesphere.com/paper/PMC12894240/full.md

---
Source: https://tomesphere.com/paper/PMC12894240