Endo-G$^{2}$T: Geometry-Guided & Temporally Aware Time-Embedded 4DGS For Endoscopic Scenes
Yangle Liu, Fengze Li, Kan Liu, Jieming Ma

TL;DR
Endo-G$^{2}$T introduces a geometry-guided, temporally aware training scheme for 4D Gaussian splatting that improves endoscopic scene reconstruction by anchoring geometry early and maintaining temporal consistency.
Contribution
The paper proposes a novel training scheme combining geometry-guided prior distillation, a time-embedded Gaussian field, and keyframe-constrained streaming for improved 4D scene reconstruction.
Findings
Achieves state-of-the-art results on EndoNeRF and StereoMIS-P1 datasets.
Enhances temporal coherence and geometric accuracy in endoscopic scene modeling.
Improves efficiency and stability in long-horizon dynamic scene reconstruction.
Abstract
Endoscopic (endo) video exhibits strong view-dependent effects such as specularities, wet reflections, and occlusions. Pure photometric supervision misaligns with geometry and triggers early geometric drift, where erroneous shapes are reinforced during densification and become hard to correct. We ask how to anchor geometry early for 4D Gaussian splatting (4DGS) while maintaining temporal consistency and efficiency in dynamic endoscopic scenes. Thus, we present Endo-GT, a geometry-guided and temporally aware training scheme for time-embedded 4DGS. First, geo-guided prior distillation converts confidence-gated monocular depth into supervision with scale-invariant depth and depth-gradient losses, using a warm-up-to-cap schedule to inject priors softly and avoid early overfitting. Second, a time-embedded Gaussian field represents dynamics in XYZT with a rotor-like rotation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Computer Graphics and Visualization Techniques · Image Enhancement Techniques
