Benchmarking Pretrained Attention-based Models for Real-Time Recognition in Robot-Assisted Esophagectomy
Ronald L.P.D. de Jong, Yasmina al Khalil, Tim J.M. Jaspers, Romy C., van Jaarsveld, Gino M. Kuiper, Yiping Li, Richard van Hillegersberg, Jelle P., Ruurda, Marcel Breeuwer, Fons van der Sommen

TL;DR
This paper benchmarks eight real-time deep learning models, including attention-based networks, on a novel RAMIE dataset for surgical anatomy segmentation, highlighting the superiority of attention mechanisms and pretraining on ADE20k.
Contribution
It introduces the largest dataset for RAMIE anatomy segmentation and evaluates the performance of various models, emphasizing the benefits of attention-based architectures and specific pretraining datasets.
Findings
Attention-based models outperform traditional CNNs.
Pretraining on ADE20k yields better results than ImageNet.
SegNeXt and Mask2Former achieve top Dice scores.
Abstract
Esophageal cancer is among the most common types of cancer worldwide. It is traditionally treated using open esophagectomy, but in recent years, robot-assisted minimally invasive esophagectomy (RAMIE) has emerged as a promising alternative. However, robot-assisted surgery can be challenging for novice surgeons, as they often suffer from a loss of spatial orientation. Computer-aided anatomy recognition holds promise for improving surgical navigation, but research in this area remains limited. In this study, we developed a comprehensive dataset for semantic segmentation in RAMIE, featuring the largest collection of vital anatomical structures and surgical instruments to date. Handling this diverse set of classes presents challenges, including class imbalance and the recognition of complex structures such as nerves. This study aims to understand the challenges and limitations of current…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLung Cancer Diagnosis and Treatment · Radiomics and Machine Learning in Medical Imaging · Medical Imaging Techniques and Applications
MethodsSparse Evolutionary Training
