TransNetR: Transformer-based Residual Network for Polyp Segmentation with Multi-Center Out-of-Distribution Testing
Debesh Jha, Nikhil Kumar Tomar, Vanshali Sharma, Ulas Bagci

TL;DR
TransNetR is a real-time transformer-based residual network that significantly improves polyp segmentation accuracy and generalizability across multiple datasets, including out-of-distribution data, for colonoscopy screening.
Contribution
The paper introduces TransNetR, a novel transformer-based residual network architecture that achieves high accuracy and real-time processing speed for polyp segmentation, with strong generalization across diverse datasets.
Findings
Achieved a dice coefficient of 0.8706 and IoU of 0.8016 on Kvasir-SEG.
Maintained real-time speed of 54.60 fps.
Outperformed existing methods on multiple out-of-distribution datasets.
Abstract
Colonoscopy is considered the most effective screening test to detect colorectal cancer (CRC) and its precursor lesions, i.e., polyps. However, the procedure experiences high miss rates due to polyp heterogeneity and inter-observer dependency. Hence, several deep learning powered systems have been proposed considering the criticality of polyp detection and segmentation in clinical practices. Despite achieving improved outcomes, the existing automated approaches are inefficient in attaining real-time processing speed. Moreover, they suffer from a significant performance drop when evaluated on inter-patient data, especially those collected from different centers. Therefore, we intend to develop a novel real-time deep learning based architecture, Transformer based Residual network (TransNetR), for colon polyp segmentation and evaluate its diagnostic performance. The proposed architecture,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsColorectal Cancer Screening and Detection · Radiomics and Machine Learning in Medical Imaging · COVID-19 diagnosis using AI
MethodsMulti-Head Attention · Attention Is All You Need · Test · Byte Pair Encoding · Position-Wise Feed-Forward Layer · Residual Connection · Dense Connections · Absolute Position Encodings · Linear Layer · Label Smoothing
