4K4D: Real-Time 4D View Synthesis at 4K Resolution

Zhen Xu; Sida Peng; Haotong Lin; Guangzhao He; Jiaming Sun; Yujun; Shen; Hujun Bao; Xiaowei Zhou

arXiv:2310.11448·cs.CV·October 31, 2023·6 cites

4K4D: Real-Time 4D View Synthesis at 4K Resolution

Zhen Xu, Sida Peng, Haotong Lin, Guangzhao He, Jiaming Sun, Yujun, Shen, Hujun Bao, Xiaowei Zhou

PDF

Open Access 1 Models

TL;DR

This paper introduces 4K4D, a novel 4D point cloud representation enabling real-time, high-quality 4K resolution view synthesis of dynamic scenes by leveraging hardware rasterization and a hybrid appearance model.

Contribution

We propose 4K4D, a 4D point cloud framework with a hybrid appearance model and differentiable depth peeling, achieving unprecedented rendering speed and quality for dynamic scene synthesis.

Findings

01

Over 400 FPS at 1080p resolution on DNA-Rendering dataset

02

80 FPS at 4K resolution on ENeRF-Outdoor dataset

03

30x faster rendering compared to previous methods

Abstract

This paper targets high-fidelity and real-time view synthesis of dynamic 3D scenes at 4K resolution. Recently, some methods on dynamic view synthesis have shown impressive rendering quality. However, their speed is still limited when rendering high-resolution images. To overcome this problem, we propose 4K4D, a 4D point cloud representation that supports hardware rasterization and enables unprecedented rendering speed. Our representation is built on a 4D feature grid so that the points are naturally regularized and can be robustly optimized. In addition, we design a novel hybrid appearance model that significantly boosts the rendering quality while preserving efficiency. Moreover, we develop a differentiable depth peeling algorithm to effectively learn the proposed model from RGB videos. Experiments show that our representation can be rendered at over 400 FPS on the DNA-Rendering…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
dylanebert/4K4D
model· ♡ 7
♡ 7

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Computer Graphics and Visualization Techniques · Robotics and Sensor-Based Localization

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings