Graph Neural Networks for Surgical Scene Segmentation
Yihan Li, Nikhil Churamani, Maria Robu, Imanol Luengo, Danail Stoyanov

TL;DR
This paper introduces graph neural network-based segmentation models that improve the accuracy and anatomical coherence of surgical scene analysis, especially for rare and critical structures, by combining Vision Transformers with graph reasoning.
Contribution
It presents novel integration of ViT encoders with GNNs for surgical segmentation, enhancing spatial relationships and long-range dependencies over prior methods.
Findings
Achieved 7-8% higher mIoU than baselines.
Produced more anatomically coherent predictions.
Improved segmentation of rare and safety-critical structures.
Abstract
Purpose: Accurate identification of hepatocystic anatomy is critical to preventing surgical complications during laparoscopic cholecystectomy. Deep learning models often struggle with occlusions, long-range dependencies, and capturing the fine-scale geometry of rare structures. This work addresses these challenges by introducing graph-based segmentation approaches that enhance spatial and semantic understanding in surgical scene analyses. Methods: We propose two segmentation models integrating Vision Transformer (ViT) feature encoders with Graph Neural Networks (GNNs) to explicitly model spatial relationships between anatomical regions. (1) A static k Nearest Neighbours (k-NN) graph with a Graph Convolutional Network with Initial Residual and Identity Mapping (GCNII) enables stable long-range information propagation. (2) A dynamic Differentiable Graph Generator (DGG) with a Graph…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical Simulation and Training · Advanced Neural Network Applications · Medical Imaging and Analysis
