TL;DR
DietNeRF enhances few-shot view synthesis by incorporating a semantic consistency loss using pre-trained encoders, enabling realistic rendering from minimal input views and improving 3D scene reconstruction quality.
Contribution
It introduces a semantic consistency loss into NeRF, allowing effective few-shot view synthesis with minimal input views by leveraging pre-trained visual encoders like CLIP.
Findings
Improves perceptual quality of few-shot view synthesis from scratch.
Enables rendering with as few as one input image when pre-trained.
Produces plausible completions of unobserved regions.
Abstract
We present DietNeRF, a 3D neural scene representation estimated from a few images. Neural Radiance Fields (NeRF) learn a continuous volumetric representation of a scene through multi-view consistency, and can be rendered from novel viewpoints by ray casting. While NeRF has an impressive ability to reconstruct geometry and fine details given many images, up to 100 for challenging 360{\deg} scenes, it often finds a degenerate solution to its image reconstruction objective when only a few input views are available. To improve few-shot quality, we propose DietNeRF. We introduce an auxiliary semantic consistency loss that encourages realistic renderings at novel poses. DietNeRF is trained on individual scenes to (1) correctly render given input views from the same pose, and (2) match high-level semantic attributes across different, random poses. Our semantic loss allows us to supervise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Layer · Contrastive Language-Image Pre-training · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Softmax · Dense Connections · Attention Is All You Need · Dropout · Residual Connection · Byte Pair Encoding
