# Endo-DET: A Domain-Specific Detection Framework for Multi-Class Endoscopic Disease Detection

**Authors:** Yijie Lu, Yixiang Zhao, Qiang Yu, Wei Shao, Renbin Shen

PMC · DOI: 10.3390/jimaging12030112 · Journal of Imaging · 2026-03-06

## TL;DR

Endo-DET is a new framework for detecting multiple types of gastrointestinal diseases in endoscopic images, improving accuracy and real-time performance.

## Contribution

Endo-DET introduces a domain-specific detection framework with novel modules to address illumination inconsistency and scale variability in endoscopic imaging.

## Key findings

- Endo-DET improves mAP50-95 by 5.8 to 10.8 percentage points across multiple datasets compared to the DEIM baseline.
- The framework achieves real-time performance at 330 FPS with TensorRT FP16 optimization.
- Endo-DET shows consistent cross-dataset improvements in detection accuracy and robustness.

## Abstract

Gastrointestinal cancers account for roughly a quarter of global cancer incidence, and early detection through endoscopy has proven effective in reducing mortality. Multi-class endoscopic disease detection, however, faces three persistent challenges: feature redundancy from non-pathological content, severe illumination inconsistency across imaging modalities, and extreme scale variability with blurry boundaries. This paper introduces Endo-DET, a domain-specific detection framework addressing these challenges through three synergistic components. The Adaptive Lesion-Discriminative Filtering (ALDF) module achieves lesion-focused attention via sparse simplex projection, reducing complexity from O(N2) to O(αN2). The Global–Local Illumination Modulation Neck (GLIM-Neck) enables illumination-aware multi-scale fusion through four cooperative mechanisms, maintaining stable performance across white-light endoscopy, narrow-band imaging, and chromoendoscopy. The Lesion-aware Unified Calibration and Illumination-robust Discrimination (LUCID) module uses dual-stream reciprocal modulation to integrate boundary-sensitive textures with global semantics while suppressing instrument artifacts. Experiments on EDD2020, Kvasir-SEG, PolypGen2021, and CVC-ClinicDB show that Endo-DET improves mAP50-95 over the DEIM baseline by 5.8, 10.8, 4.1, and 10.1 percentage points respectively, with mAP75 gains of 6.1, 10.3, 6.8, and 9.3 points, and Recall50-95 improvements of 10.9, 12.1, 11.1, and 11.5 points. Running at 330 FPS with TensorRT FP16 optimization, Endo-DET achieves consistent cross-dataset improvements while maintaining real-time capability, providing a methodological foundation for clinical computer-aided diagnosis.

## Full-text entities

- **Diseases:** gastrointestinal disease (MESH:D005767), colorectal cancer (MESH:D015179), colorectal polyp (MESH:D003111), gastric cancer (MESH:D013274), dysplasia (MESH:D015792), GCT (MESH:C537296), polyp (MESH:D011127), Endoscopic disease (MESH:D004194), Gastrointestinal cancers (MESH:D005770), lesion (MESH:D009059), injury to (MESH:D014947), BE (MESH:D001471), HGD (MESH:D008228), fatigue (MESH:D005221), LUCID (MESH:D058926), Cancer (MESH:D009369)
- **Chemicals:** DET (MESH:D003671), ALDF (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13028429/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13028429/full.md

## References

34 references — full list in the complete paper: https://tomesphere.com/paper/PMC13028429/full.md

---
Source: https://tomesphere.com/paper/PMC13028429