TL;DR
This paper introduces Dynamic Gaussian Marbles, a novel method for novel view synthesis from monocular videos that overcomes the limitations of existing Gaussian-based methods by incorporating priors, hierarchical learning, and simplified Gaussian shapes.
Contribution
It extends Gaussian scene representations to casual monocular videos through three key modifications: isotropic Gaussian marbles, hierarchical learning, and priors including tracking, enabling effective 3D scene capture.
Findings
Outperforms Gaussian baselines in quality on dynamic scenes
Achieves comparable results to non-Gaussian methods
Maintains efficiency, editability, and tracking benefits
Abstract
Gaussian splatting has become a popular representation for novel-view synthesis, exhibiting clear strengths in efficiency, photometric quality, and compositional edibility. Following its success, many works have extended Gaussians to 4D, showing that dynamic Gaussians maintain these benefits while also tracking scene geometry far better than alternative representations. Yet, these methods assume dense multi-view videos as supervision. In this work, we are interested in extending the capability of Gaussian scene representations to casually captured monocular videos. We show that existing 4D Gaussian methods dramatically fail in this setup because the monocular setting is underconstrained. Building off this finding, we propose a method we call Dynamic Gaussian Marbles, which consist of three core modifications that target the difficulties of the monocular setting. First, we use isotropic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFocus
