Towards Physically Consistent 4D Scene Reconstruction for Closed-loop Autonomous Driving Simulation
Bowyn Tan, Yutong Xie, Bai Huang, Fan Luo, Xiao Li, Naizheng Wang, Yang Guan, Shengbo Eben Li

TL;DR
This paper introduces a novel hierarchical training framework with temporal regularization to achieve physically consistent 4D scene reconstruction, enhancing closed-loop autonomous driving simulation.
Contribution
It proposes Orthogonal Projected Gradient (OPG) and a temporal regularization strategy to improve spatial-temporal disentanglement and scene consistency in 4D reconstructions.
Findings
Maintains stable novel-view synthesis in 4D scene reconstruction.
Outperforms existing methods in observation-reproducing metrics.
Ensures physically consistent scene evolution in closed-loop simulation.
Abstract
High-fidelity street scene reconstruction is pivotal for end-to-end autonomous driving simulation, where novel-view synthesis (NVS) and time-varying information modeling are two fundamental capabilities to facilitate closed-loop training. However, existing 3DGS methods and their 4D extensions fail to simultaneously achieve both. To bridge this gap, we establish an information-geometric diagnostic framework, revealing that this limitation stems from a credit assignment dilemma between spatial and temporal parameters. Specifically, the deterministic coupling between viewpoint and time in single-source observation creates a low-rank structure that induces massive null-space ambiguity between static view-dependent and dynamic time-varying components. Temporal information overshadows spatial cues, causing the estimation variance of spatial parameters to diverge. To address this issue, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
