SV-GS: Sparse View 4D Reconstruction with Skeleton-Driven Gaussian Splatting
Jun-Jee Chao, Volkan Isler

TL;DR
SV-GS introduces a skeleton-driven Gaussian Splatting framework for 4D dynamic reconstruction from sparse, multi-view observations, outperforming existing methods and enabling practical real-world applications.
Contribution
The paper proposes a novel skeleton-guided deformation approach that reconstructs dynamic scenes from sparse data, relaxing the need for dense multi-view video inputs.
Findings
Outperforms existing methods by up to 34% in PSNR on synthetic datasets.
Achieves comparable results to dense monocular video methods with fewer frames.
Replaces initial static reconstruction with a diffusion prior for better practicality.
Abstract
Reconstructing a dynamic target moving over a large area is challenging. Standard approaches for dynamic object reconstruction require dense coverage in both the viewing space and the temporal dimension, typically relying on multi-view videos captured at each time step. However, such setups are only possible in constrained environments. In real-world scenarios, observations are often sparse over time and captured sparsely from diverse viewpoints (e.g., from security cameras), making dynamic reconstruction highly ill-posed. We present SV-GS, a framework that simultaneously estimates a deformation model and the object's motion over time under sparse observations. To initialize SV-GS, we leverage a rough skeleton graph and an initial static reconstruction as inputs to guide motion estimation. (Later, we show that this input requirement can be relaxed.) Our method optimizes a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
