ProtoFlow: Interpretable and Robust Surgical Workflow Modeling with Learned Dynamic Scene Graph Prototypes
Felix Holm, Ghazal Ghazaei, Nassir Navab

TL;DR
ProtoFlow is a novel framework that learns interpretable dynamic scene graph prototypes to model complex surgical workflows, improving accuracy, robustness, and explainability in AI-assisted surgery with limited data.
Contribution
It introduces a prototype-based GNN approach that enhances interpretability and robustness in surgical workflow modeling, especially under data scarcity.
Findings
Outperforms standard GNN baselines in accuracy
Maintains performance with as few as one training video
Provides interpretable insights into surgical sub-techniques
Abstract
Purpose: Detailed surgical recognition is critical for advancing AI-assisted surgery, yet progress is hampered by high annotation costs, data scarcity, and a lack of interpretable models. While scene graphs offer a structured abstraction of surgical events, their full potential remains untapped. In this work, we introduce ProtoFlow, a novel framework that learns dynamic scene graph prototypes to model complex surgical workflows in an interpretable and robust manner. Methods: ProtoFlow leverages a graph neural network (GNN) encoder-decoder architecture that combines self-supervised pretraining for rich representation learning with a prototype-based fine-tuning stage. This process discovers and refines core prototypes that encapsulate recurring, clinically meaningful patterns of surgical interaction, forming an explainable foundation for workflow analysis. Results: We evaluate our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical Simulation and Training · Artificial Intelligence in Healthcare and Education · Machine Learning in Healthcare
