Towards Robust Algorithms for Surgical Phase Recognition via Digital Twin Representation
Hao Ding, Yuqian Zhang, Wenzheng Cheng, Xinyu Wang, Xu Lian, Chenhao, Yu, Hongchao Shu, Ji Woong Kim, Axel Krieger, Mathias Unberath

TL;DR
This paper introduces a digital twin-based framework for surgical phase recognition that significantly improves robustness against out-of-distribution and corrupted data, outperforming traditional models.
Contribution
The study presents a novel digital twin representation approach for surgical phase recognition, enhancing robustness and reliability of models in varied surgical video conditions.
Findings
Achieved 80.3% accuracy on corrupted Cholec80 test set
Outperformed baseline by 3.9% on OOD samples
Using DT as augmentation improves robustness
Abstract
Surgical phase recognition (SPR) is an integral component of surgical data science, enabling high-level surgical analysis. End-to-end trained neural networks that predict surgical phase directly from videos have shown excellent performance on benchmarks. However, these models struggle with robustness due to non-causal associations in the training set. Our goal is to improve model robustness to variations in the surgical videos by leveraging the digital twin (DT) paradigm -- an intermediary layer to separate high-level analysis (SPR) from low-level processing. As a proof of concept, we present a DT representation-based framework for SPR from videos. The framework employs vision foundation models with reliable low-level scene understanding to craft DT representation. We embed the DT representation in place of raw video inputs in the state-of-the-art SPR model. The framework is trained on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced X-ray Imaging Techniques · Advanced X-ray and CT Imaging · Medical Image Segmentation Techniques
