SLS4D: Sparse Latent Space for 4D Novel View Synthesis

Qi-Yuan Feng; Hao-Xiang Chen; Qun-Ce Xu; Tai-Jiang Mu

arXiv:2312.09743·cs.CV·December 18, 2023·1 cites

SLS4D: Sparse Latent Space for 4D Novel View Synthesis

Qi-Yuan Feng, Hao-Xiang Chen, Qun-Ce Xu, Tai-Jiang Mu

PDF

Open Access

TL;DR

This paper introduces SLS4D, a sparse latent space approach for 4D novel view synthesis that efficiently models dynamic scenes with significantly fewer parameters than existing methods.

Contribution

The paper proposes a novel sparse latent space representation for 4D scenes, capturing global dynamics efficiently and reducing model complexity compared to prior dense grid-based methods.

Findings

01

Achieves state-of-the-art 4D view synthesis quality.

02

Uses only about 6% of parameters compared to recent methods.

03

Effectively models global scene dynamics with sparse representations.

Abstract

Neural radiance field (NeRF) has achieved great success in novel view synthesis and 3D representation for static scenarios. Existing dynamic NeRFs usually exploit a locally dense grid to fit the deformation field; however, they fail to capture the global dynamics and concomitantly yield models of heavy parameters. We observe that the 4D space is inherently sparse. Firstly, the deformation field is sparse in spatial but dense in temporal due to the continuity of of motion. Secondly, the radiance field is only valid on the surface of the underlying scene, usually occupying a small fraction of the whole space. We thus propose to represent the 4D scene using a learnable sparse latent space, a.k.a. SLS4D. Specifically, SLS4D first uses dense learnable time slot features to depict the temporal space, from which the deformation field is fitted with linear multi-layer perceptions (MLP) to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · 3D Shape Modeling and Analysis · Human Pose and Action Recognition