Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis

Ajay Jain; Matthew Tancik; Pieter Abbeel

arXiv:2104.00677·cs.CV·April 2, 2021

Putting NeRF on a Diet: Semantically Consistent Few-Shot View Synthesis

Ajay Jain, Matthew Tancik, Pieter Abbeel

PDF

2 Repos 1 Models

TL;DR

DietNeRF enhances few-shot view synthesis by incorporating a semantic consistency loss using pre-trained encoders, enabling realistic rendering from minimal input views and improving 3D scene reconstruction quality.

Contribution

It introduces a semantic consistency loss into NeRF, allowing effective few-shot view synthesis with minimal input views by leveraging pre-trained visual encoders like CLIP.

Findings

01

Improves perceptual quality of few-shot view synthesis from scratch.

02

Enables rendering with as few as one input image when pre-trained.

03

Produces plausible completions of unobserved regions.

Abstract

We present DietNeRF, a 3D neural scene representation estimated from a few images. Neural Radiance Fields (NeRF) learn a continuous volumetric representation of a scene through multi-view consistency, and can be rendered from novel viewpoints by ray casting. While NeRF has an impressive ability to reconstruct geometry and fine details given many images, up to 100 for challenging 360{\deg} scenes, it often finds a degenerate solution to its image reconstruction objective when only a few input views are available. To improve few-shot quality, we propose DietNeRF. We introduce an auxiliary semantic consistency loss that encourages realistic renderings at novel poses. DietNeRF is trained on individual scenes to (1) correctly render given input views from the same pose, and (2) match high-level semantic attributes across different, random poses. Our semantic loss allows us to supervise…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

🤗
flax-community/putting-nerf-on-a-diet
model· 1 dl· ♡ 6
1 dl♡ 6

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Contrastive Language-Image Pre-training · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Softmax · Dense Connections · Attention Is All You Need · Dropout · Residual Connection · Byte Pair Encoding