Generative Lifting of Multiview to 3D from Unknown Pose: Wrapping NeRF inside Diffusion
Xin Yuan, Rana Hanocka, Michael Maire

TL;DR
This paper introduces a novel generative framework that jointly learns camera pose estimation and 3D scene reconstruction from unannotated images using a diffusion model, enabling NeRF creation without prior pose information.
Contribution
It presents a new end-to-end diffusion-based approach that simultaneously learns pose prediction and NeRF parameters from unannotated images, overcoming limitations of existing methods.
Findings
Successfully reconstructs NeRFs without pose annotations
Generates novel views from learned 3D models
Outperforms competing methods on challenging scenes
Abstract
We cast multiview reconstruction from unknown pose as a generative modeling problem. From a collection of unannotated 2D images of a scene, our approach simultaneously learns both a network to predict camera pose from 2D image input, as well as the parameters of a Neural Radiance Field (NeRF) for the 3D scene. To drive learning, we wrap both the pose prediction network and NeRF inside a Denoising Diffusion Probabilistic Model (DDPM) and train the system via the standard denoising objective. Our framework requires the system accomplish the task of denoising an input 2D image by predicting its pose and rendering the NeRF from that pose. Learning to denoise thus forces the system to concurrently learn the underlying 3D NeRF representation and a mapping from images to camera extrinsic parameters. To facilitate the latter, we design a custom network architecture to represent pose as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Computer Graphics and Visualization Techniques · Image Processing and 3D Reconstruction
MethodsDiffusion
