TL;DR
DepthPolyp is a lightweight, pseudo-depth-guided segmentation framework that achieves real-time, accurate polyp segmentation in colonoscopy, robust to real-world challenges and suitable for resource-limited clinical settings.
Contribution
It introduces a novel multi-task learning architecture with efficient feature modulation techniques, enabling superior performance and generalization in real-time colonoscopy scenarios.
Findings
Outperforms larger models in polyp segmentation accuracy.
Runs at over 180 FPS on mobile devices, enabling real-time clinical use.
Demonstrates strong cross-dataset generalization, especially on degraded data.
Abstract
Accurate polyp segmentation in colonoscopy is essential for early colorectal cancer detection, yet real-world clinical environments pose persistent challenges such as motion blur, specular reflections, and illumination instability. Most existing methods are optimized on clean benchmark images and suffer noticeable performance degradation when deployed in authentic surgical scenarios. We propose DepthPolyp, a lightweight and robust segmentation framework based on pseudo-depth-guided multi-task learning and efficient feature modulation. The architecture combines hierarchical Ghost factorization for compact feature generation, Interleaved Shuffle Fusion for low-cost cross-scale interaction, and Dynamic Group Gating for adaptive group-wise feature weighting. Extensive experiments demonstrate that DepthPolyp achieves strong cross-dataset generalization when trained on degraded data and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
