GRVS: a Generalizable and Recurrent Approach to Monocular Dynamic View Synthesis
Thomas Tanay, Mohammed Brahimi, Michal Nazarczuk, Qingwen Zhang, Sibi Catley-Chandar, Arthur Moreau, Zhensong Zhang, Eduardo P\'erez-Pellitero

TL;DR
This paper introduces GRVS, a novel recurrent model for monocular dynamic view synthesis that disentangles camera and scene motion, outperforming existing scene-specific and diffusion-based methods on complex datasets.
Contribution
The paper presents a generalizable, recurrent approach with efficient plane sweep utilization for dynamic view synthesis, enabling fine-grained control and improved geometric reconstruction.
Findings
Outperforms scene-specific Gaussian Splatting approaches.
Achieves better geometric detail reconstruction in dynamic scenes.
Handles complex, high-resolution sequences with diverse scene dynamics.
Abstract
Synthesizing novel views from monocular videos of dynamic scenes remains a challenging problem. Scene-specific methods that optimize 4D representations with explicit motion priors often break down in highly dynamic regions where multi-view information is hard to exploit. Diffusion-based approaches that integrate camera control into large pre-trained models can produce visually plausible videos but frequently suffer from geometric inconsistencies across both static and dynamic areas. Both families of methods also require substantial computational resources. Building on the success of generalizable models for static novel view synthesis, we adapt the framework to dynamic inputs and propose a new model with two key components: (1) a recurrent loop that enables unbounded and asynchronous mapping between input and target videos and (2) an efficient use of plane sweeps over dynamic inputs to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
