Learning with Geometric Priors in U-Net Variants for Polyp Segmentation
Fabian Vazquez, Jose A. Nu\~nez, Diego Adame, Alissen Moreno, Augustin Zhan, Huimin Li, Jinghao Yang, Haoteng Tang, Bin Fu, Pengfei Gu

TL;DR
This paper introduces a Geometric Prior-guided Module (GPM) that enhances U-Net-based polyp segmentation models by incorporating explicit geometric and depth information, leading to improved accuracy in challenging colonoscopy images.
Contribution
The paper proposes a novel GPM that injects geometric priors into U-Net variants using depth estimation from a fine-tuned VGGT, improving segmentation performance.
Findings
Consistent performance improvements over baseline models on five datasets.
Effective integration of depth-based geometric priors into U-Net architectures.
GPM is versatile and can be added to various U-Net variants.
Abstract
Accurate and robust polyp segmentation is essential for early colorectal cancer detection and for computer-aided diagnosis. While convolutional neural network-, Transformer-, and Mamba-based U-Net variants have achieved strong performance, they still struggle to capture geometric and structural cues, especially in low-contrast or cluttered colonoscopy scenes. To address this challenge, we propose a novel Geometric Prior-guided Module (GPM) that injects explicit geometric priors into U-Net-based architectures for polyp segmentation. Specifically, we fine-tune the Visual Geometry Grounded Transformer (VGGT) on a simulated ColonDepth dataset to estimate depth maps of polyp images tailored to the endoscopic domain. These depth maps are then processed by GPM to encode geometric priors into the encoder's feature maps, where they are further refined using spatial and channel attention…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsColorectal Cancer Screening and Detection · Advanced Neural Network Applications · Advanced Image and Video Retrieval Techniques
