Cascaded and Generalizable Neural Radiance Fields for Fast View   Synthesis

Phong Nguyen-Ha; Lam Huynh; Esa Rahtu; Jiri Matas; Janne Heikkila

arXiv:2208.04717·cs.CV·November 22, 2023

Cascaded and Generalizable Neural Radiance Fields for Fast View Synthesis

Phong Nguyen-Ha, Lam Huynh, Esa Rahtu, Jiri Matas, Janne Heikkila

PDF

Open Access

TL;DR

CG-NeRF introduces a fast, generalizable neural radiance field approach with novel modules for efficient view synthesis, achieving high-quality results on unseen data using minimal training and a single GPU.

Contribution

The paper proposes a new architecture with a coarse predictor and convolutional renderer that enables fast, generalizable view synthesis without scene-specific training.

Findings

01

Outperforms state-of-the-art methods on synthetic and real datasets.

02

Achieves high-quality novel views with only photometric losses.

03

Maintains high-speed rendering using a single GPU.

Abstract

We present CG-NeRF, a cascade and generalizable neural radiance fields method for view synthesis. Recent generalizing view synthesis methods can render high-quality novel views using a set of nearby input views. However, the rendering speed is still slow due to the nature of uniformly-point sampling of neural radiance fields. Existing scene-specific methods can train and render novel views efficiently but can not generalize to unseen data. Our approach addresses the problems of fast and generalizing view synthesis by proposing two novel modules: a coarse radiance fields predictor and a convolutional-based neural renderer. This architecture infers consistent scene geometry based on the implicit neural fields and renders new views efficiently using a single GPU. We first train CG-NeRF on multiple 3D scenes of the DTU dataset, and the network can produce high-quality and accurate novel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Computer Graphics and Visualization Techniques · 3D Shape Modeling and Analysis

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings