SurgicalGaussian: Deformable 3D Gaussians for High-Fidelity Surgical Scene Reconstruction
Weixing Xie, Junfeng Yao, Xianpeng Cao, Qiqin Lin, Zerui Tang, Xiao, Dong, Xiaohu Guo

TL;DR
SurgicalGaussian introduces a deformable 3D Gaussian Splatting method for high-fidelity, real-time reconstruction of dynamic surgical scenes, effectively handling occlusions and intricate tissue details in endoscopic videos.
Contribution
It presents a novel deformable 3D Gaussian approach that models soft tissue dynamics and improves reconstruction quality and speed over existing neural radiance field-based methods.
Findings
Outperforms existing methods in rendering quality
Achieves faster rendering speeds
Uses less GPU memory
Abstract
Dynamic reconstruction of deformable tissues in endoscopic video is a key technology for robot-assisted surgery. Recent reconstruction methods based on neural radiance fields (NeRFs) have achieved remarkable results in the reconstruction of surgical scenes. However, based on implicit representation, NeRFs struggle to capture the intricate details of objects in the scene and cannot achieve real-time rendering. In addition, restricted single view perception and occluded instruments also propose special challenges in surgical scene reconstruction. To address these issues, we develop SurgicalGaussian, a deformable 3D Gaussian Splatting method to model dynamic surgical scenes. Our approach models the spatio-temporal features of soft tissues at each time stamp via a forward-mapping deformation MLP and regularization to constrain local 3D Gaussians to comply with consistent movement. With the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnatomy and Medical Technology · 3D Shape Modeling and Analysis · Surgical Simulation and Training
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
