Hybrid(Transformer+CNN)-based Polyp Segmentation
Madan Baduwal

TL;DR
This paper introduces a hybrid Transformer and CNN-based model for polyp segmentation in colonoscopy images, significantly improving accuracy and robustness against artifacts and boundary ambiguities compared to existing methods.
Contribution
A novel hybrid architecture combining Transformer and CNN components that enhances polyp segmentation accuracy and robustness in challenging endoscopic conditions.
Findings
Improved segmentation recall to 95.55%.
Achieved 0.07% higher accuracy than state-of-the-art.
Demonstrated robustness against common endoscopic artifacts.
Abstract
Colonoscopy is still the main method of detection and segmentation of colonic polyps, and recent advancements in deep learning networks such as U-Net, ResUNet, Swin-UNet, and PraNet have made outstanding performance in polyp segmentation. Yet, the problem is extremely challenging due to high variation in size, shape, endoscopy types, lighting, imaging protocols, and ill-defined boundaries (fluid, folds) of the polyps, rendering accurate segmentation a challenging and problematic task. To address these critical challenges in polyp segmentation, we introduce a hybrid (Transformer + CNN) model that is crafted to enhance robustness against evolving polyp characteristics. Our hybrid architecture demonstrates superior performance over existing solutions, particularly in addressing two critical challenges: (1) accurate segmentation of polyps with ill-defined margins through boundary-aware…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsColorectal Cancer Screening and Detection · Advanced Neural Network Applications · COVID-19 diagnosis using AI
