A Construct-Optimize Approach to Sparse View Synthesis without Camera Pose
Kaiwen Jiang, Yang Fu, Mukund Varma T, Yash Belhe, Xiaolong Wang, Hao, Su, Ravi Ramamoorthi

TL;DR
This paper introduces a novel construct-and-optimize method for sparse view synthesis that does not rely on camera poses, leveraging 3D Gaussian splatting and monocular depth to improve quality from limited views.
Contribution
The paper proposes a new construct-and-optimize approach using Gaussian splatting and monocular depth to synthesize novel views without camera pose information, outperforming existing methods.
Findings
Achieves higher quality results with as few as three views.
Outperforms existing methods including InstantNGP and Gaussian Splatting.
Improves with more views, even with limited data.
Abstract
Novel view synthesis from a sparse set of input images is a challenging problem of great practical interest, especially when camera poses are absent or inaccurate. Direct optimization of camera poses and usage of estimated depths in neural radiance field algorithms usually do not produce good results because of the coupling between poses and depths, and inaccuracies in monocular depth estimation. In this paper, we leverage the recent 3D Gaussian splatting method to develop a novel construct-and-optimize method for sparse view synthesis without camera poses. Specifically, we construct a solution progressively by using monocular depth and projecting pixels back into the 3D world. During construction, we optimize the solution by detecting 2D correspondences between training views and the corresponding rendered images. We develop a unified differentiable pipeline for camera registration and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSparse Evolutionary Training
