Swin transformers are robust to distribution and concept drift in endoscopy-based longitudinal rectal cancer assessment
Jorge Tapias Gomez, Aneesh Rangnekar, Hannah Williams, Hannah, Thompson, Julio Garcia-Aguilar, Joshua Jesse Smith, Harini Veeraraghavan

TL;DR
This study demonstrates that Swin transformers are highly robust to distribution and concept drift in endoscopy images, enabling consistent longitudinal rectal cancer assessment across varied imaging conditions.
Contribution
The paper introduces a hierarchical Swin transformer model that outperforms other deep learning models in detecting rectal cancer and regrowth in endoscopy images, even under distribution shifts.
Findings
Swin transformer achieved high accuracy in in-distribution and out-of-distribution datasets.
Swin outperformed ResNet and ViT models in robustness to color shifts.
Deep learning models can provide objective, consistent assessment in endoscopic cancer monitoring.
Abstract
Endoscopic images are used at various stages of rectal cancer treatment starting from cancer screening, diagnosis, during treatment to assess response and toxicity from treatments such as colitis, and at follow up to detect new tumor or local regrowth (LR). However, subjective assessment is highly variable and can underestimate the degree of response in some patients, subjecting them to unnecessary surgery, or overestimate response that places patients at risk of disease spread. Advances in deep learning has shown the ability to produce consistent and objective response assessment for endoscopic images. However, methods for detecting cancers, regrowth, and monitoring response during the entire course of patient treatment and follow-up are lacking. This is because, automated diagnosis and rectal cancer response assessment requires methods that are robust to inherent imaging illumination…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsColorectal Cancer Screening and Detection · Radiomics and Machine Learning in Medical Imaging
MethodsAttention Is All You Need · Average Pooling · Softmax · Linear Layer · Layer Normalization · Dense Connections · Residual Connection · Multi-Head Attention · Global Average Pooling · Kaiming Initialization
