OrbitNVS: Harnessing Video Diffusion Priors for Novel View Synthesis
Jinglin Liang, Zijian Zhou, Rui Huang, Shuangping Huang, and Yichen Gong

TL;DR
OrbitNVS introduces a novel approach to 3D view synthesis by reformulating it as an orbit video generation task, leveraging pre-trained video models and tailored strategies to improve geometry and appearance consistency from limited views.
Contribution
It adapts a pre-trained video generation model for novel view synthesis, incorporating camera control and normal map guidance to enhance 3D consistency and visual quality.
Findings
Outperforms previous methods on GSO and OmniObject3D benchmarks.
Achieves significant improvements in PSNR, e.g., +2.9 dB and +2.4 dB in single-view settings.
Effectively maintains geometry and appearance consistency in synthesized views.
Abstract
Novel View Synthesis (NVS) aims to generate unseen views of a 3D object given a limited number of known views. Existing methods often struggle to synthesize plausible views for unobserved regions, particularly under single-view input, and still face challenges in maintaining geometry- and appearance-consistency. To address these issues, we propose OrbitNVS, which reformulates NVS as an orbit video generation task. Through tailored model design and training strategies, we adapt a pre-trained video generation model to the NVS task, leveraging its rich visual priors to achieve high-quality view synthesis. Specifically, we incorporate camera adapters into the video model to enable accurate camera control. To enhance two key properties of 3D objects, geometry and appearance, we design a normal map generation branch and use normal map features to guide the synthesis of the target views via…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging · Image Enhancement Techniques
