Free3D: Consistent Novel View Synthesis without 3D Representation

Chuanxia Zheng; Andrea Vedaldi

arXiv:2312.04551·cs.CV·April 2, 2024·1 cites

Free3D: Consistent Novel View Synthesis without 3D Representation

Chuanxia Zheng, Andrea Vedaldi

PDF

Open Access 1 Repo

TL;DR

Free3D presents a novel, efficient method for monocular novel view synthesis that enhances multi-view consistency without relying on explicit 3D representations, using a new pose encoding technique and lightweight attention layers.

Contribution

The paper introduces a new ray conditioning normalization layer and multi-view attention to improve view synthesis without 3D models, enabling better generalization and efficiency.

Findings

01

Achieves significant improvements over previous methods without 3D models.

02

Demonstrates strong generalization to new categories and datasets.

03

Maintains multi-view consistency with lightweight attention layers.

Abstract

We introduce Free3D, a simple accurate method for monocular open-set novel view synthesis (NVS). Similar to Zero-1-to-3, we start from a pre-trained 2D image generator for generalization, and fine-tune it for NVS. Compared to other works that took a similar approach, we obtain significant improvements without resorting to an explicit 3D representation, which is slow and memory-consuming, and without training an additional network for 3D reconstruction. Our key contribution is to improve the way the target camera pose is encoded in the network, which we do by introducing a new ray conditioning normalization (RCN) layer. The latter injects pose information in the underlying 2D image generator by telling each pixel its viewing direction. We further improve multi-view consistency by using light-weight multi-view attention layers and by sharing generation noise between the different views.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lyndonzheng/Free3D
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · Robotics and Sensor-Based Localization · Human Pose and Action Recognition