Learning Dynamic Scene Reconstruction with Sinusoidal Geometric Priors
Tian Guo, Hui Yuan, Philip Xu, David Elizondo

TL;DR
SirenPose introduces a sinusoidal representation-based loss with geometric priors and physics-inspired constraints, significantly enhancing dynamic 3D scene reconstruction accuracy and spatiotemporal consistency in fast-moving, multi-target scenes.
Contribution
The paper presents SirenPose, a novel loss function combining sinusoidal activations and geometric priors, along with an expanded dataset, to improve dynamic scene reconstruction.
Findings
Improved spatiotemporal consistency metrics.
Enhanced accuracy in rapid motion scenes.
Superior performance over prior methods.
Abstract
We propose SirenPose, a novel loss function that combines the periodic activation properties of sinusoidal representation networks with geometric priors derived from keypoint structures to improve the accuracy of dynamic 3D scene reconstruction. Existing approaches often struggle to maintain motion modeling accuracy and spatiotemporal consistency in fast moving and multi target scenes. By introducing physics inspired constraint mechanisms, SirenPose enforces coherent keypoint predictions across both spatial and temporal dimensions. We further expand the training dataset to 600,000 annotated instances to support robust learning. Experimental results demonstrate that models trained with SirenPose achieve significant improvements in spatiotemporal consistency metrics compared to prior methods, showing superior performance in handling rapid motion and complex scene changes.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Advanced Vision and Imaging · Robotics and Sensor-Based Localization
