SurgTPGS: Semantic 3D Surgical Scene Understanding with Text Promptable Gaussian Splatting
Yiming Huang, Long Bai, Beilei Cui, Kun Yuan, Guankun Wang, Mobarak I. Hoque, Nicolas Padoy, Nassir Navab, Hongliang Ren

TL;DR
SurgTPGS introduces a novel 3D surgical scene understanding method that integrates semantic features and text prompts, enabling real-time, precise reconstruction and interaction with surgical tools and anatomy.
Contribution
The paper proposes SurgTPGS, a pioneering Gaussian Splatting approach that incorporates semantic-aware features and regional optimization for enhanced 3D surgical scene understanding.
Findings
Outperforms state-of-the-art methods on real-world datasets.
Enables real-time, text-promptable 3D surgical queries.
Improves semantic reconstruction accuracy and smoothness.
Abstract
In contemporary surgical research and practice, accurately comprehending 3D surgical scenes with text-promptable capabilities is particularly crucial for surgical planning and real-time intra-operative guidance, where precisely identifying and interacting with surgical tools and anatomical structures is paramount. However, existing works focus on surgical vision-language model (VLM), 3D reconstruction, and segmentation separately, lacking support for real-time text-promptable 3D queries. In this paper, we present SurgTPGS, a novel text-promptable Gaussian Splatting method to fill this gap. We introduce a 3D semantics feature learning strategy incorporating the Segment Anything model and state-of-the-art vision-language models. We extract the segmented language features for 3D surgical scene reconstruction, enabling a more in-depth understanding of the complex surgical environment. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical Simulation and Training · 3D Shape Modeling and Analysis · Robotics and Sensor-Based Localization
MethodsFocus
